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Abstract 

We review the tests of general relativity that will become possible with space-based gravita- 
tional-wave detectors operating in the ~ 10 -5 -l Hz low-frequency band. The fundamental 
aspects of gravitation that can be tested include the presence of additional gravitational fields 
other than the metric; the number and tensorial nature of gravitational-wave polarization 
states; the velocity of propagation of gravitational waves; the binding energy and gravitational- 
wave radiation of binaries, and therefore the time evolution of binary inspirals; the strength 
and shape of the waves emitted from binary mergers and ringdowns; the true nature of as- 
trophysical black holes; and much more. The strength of this science alone calls for the swift 
implementation of a space-based detector; the remarkable richness of astrophysics, astronomy, 
and cosmology in the low-frequency gravitational-wave band make the case even stronger. 
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1 Introduction 



The first direct detection of gravitational waves (GWs), widely expected in the mid 2010s with 
advanced ground-based interferometers [1801 1374] . will represent the culmination of a fifty-year 
experimental quest [55]. Soon thereafter, newly plentiful GW observations will begin to shed 
light on the structure, populations, and astrophysics of mostly dark, highly relativistic objects 
such as black holes and neutron stars. In the low-frequency band that will be targeted by space- 
based detectors (roughly 10~ 5 to 1 Hz), GW observations will provide a census of the massive 
black-hole binaries at the center of galaxies, and characterize their merger histories; probe the 
Galactic population of binaries including highly evolved, degenerate stars; study the stellar-mass 
objects that spiral into central black holes in galactic nuclei; and possibly detect stochastic GW 
backgrounds from the dynamical evolution of the very early Universe. 

Thus, there are very strong astrophysical motivations to observe the Universe in GWs, especially 
because the systems and phenomena that can be observed in this fashion are largely orthogonal 
to those accessible to traditional electromagnetic and astroparticle astronomy. The promise of 
GWs appears just as great for fundamental physics. Einstein's theory of gravity, general relativity 
(GR), has been confirmed extensive experimental tests; but these have largely been confined to 
the Solar system, where gravity is well approximated by Newtonian gravity with small corrections. 
By contrast, GWs will provide the first direct observational probe of the dynamical, strong-field 
regime of GR, where the nature and behavior of gravity can be significantly different from the 
Newtonian picture. GWs are prima facie the perfect probe to investigate gravitation, since they 
originate directly from the bulk motion of gravitating matter, relieving the need to understand 
and model the physics of other intermediate messengers, typically photons from stellar surfaces or 
black- hole surroundings. 

Already today we can rely on a very sophisticated understanding of the analytical and numerical 
techniques required to model GW sources and their GW emission, including the post-Newtonian 
expansion [68 , 356J , black- hole perturbation theory |93| , numerical relativity for vacuum |297j and 
"wet" [149] spacetimes, and much more. That these techniques should have been developed so 
far in the absence of a dialogue with experimental data (except for the binary pulsar |241| ) is 
witness to the great perceived promise of GW astronomy. For a bird's-eye view of the field, see the 
Living Review by Sathyaprakash and Schutz [315] . who cover the physics of GWs, the principles 
of operation of GW detectors, the nature of GW sources, the data analysis of GW signals, and the 
science payoffs of GW observations for physics, astrophysics, and cosmology. 

This review focuses on the opportunities to challenge or confirm our understanding of gravi- 
tational physics that will be offered by forthcoming space-based missions to observe GWs in the 
low-frequency band between 10 -5 and 1 Hz. Most of the literature on this subject has focused 
on one mission design, LISA (the Laser Interferometer Space Antenna 353, 210, 298 ]), which was 
studied jointly by NASA and ESA between 2001 and 2011. In 2011, budgetary and programmatic 
reasons led the two agencies to end this partnership, and to pursue space-based GW detection 
separately, studying cheaper, rescoped LISA-like missions. 

ESA's proposed eLISA/NGO [30] would be smaller than LISA, fly on orbits closer to Earth, 
and operate interferometric links only along two arms. In 2012 eLISA was considered for imple- 
mentation as ESA's first large mission ("LI") in the Cosmic Vision program. A planetary mission 
was selected instead, but eLISA will be in the running for the next launch slot ("L2"), with a 
decision coming as soon as 2013. NASA ran studies on a broader range of missions [269] . including 
several variants of LISA to be implemented by NASA alone, as well as options with a geocentric 
orbit (OMEGA [155]), and without drag-free control (LAGRANGE [551]). The final study report 
concludes that scientifically compelling missions can be carried out for less, but not substantially 
less, than the full LISA cost; that scientific performance decreases far more rapidly than cost; and 
that no design choice or technology can make a dramatic reduction in cost without much greater 
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risks. The NASA study noted the possibility of participation in the ESA-led eLISA mission (if 
selected by ESA) as a minority partner. 

Whatever specific design is eventually selected, it is likely that its architecture, technology, and 
scientific reach will bear a strong resemblance to LISA's (with the appropriate scalings in sensitiv- 
ity, mission duration, and so on). Thus, the research reviewed in this article, which was targeted 
in large part to LISA, is still broadly relevant to future missions. Such LISA-like observatories 
are characterized by a few common elements: a set of three spacecraft in long-baseline (Mkm) or- 
bits, monitoring their relative displacements using laser interferometry; drag-free operation (except 
for LAGRANGE |251j), whereas displacement measurements are referenced to freely falling test 
masses protected by the spacecraft, which hover around the masses using precise micro-Newton 
thrusters; frequency correction of laser noise using a variety of means, including onboard cavities 
and interferometers, armlocking, and a LISA-specific technique known as Time-Delay Interferom- 
etry. 

The predictions of GR that can be tested by space-based GW observatories include the absence 
of gravitational fields other than the metric tensor; the number and character of GW polarization 
states; the speed of GW propagation; the detailed progress of binary inspiral, as driven by nonlinear 
gravitational dynamics and loss of energy to GWs; the strength and shape of the GWs from binary 
merger and ringdown; the true nature of astrophysical black holes; and more. 

Some of these tests will also be performed with ground-based GW detectors, but space-based 
tests will almost always have superior accuracy and significance, because low-frequency sources 
are intrinsically stronger, and will spend a larger time within the band of good detector sensitiv- 
ity. For binary systems with very asymmetric mass ratios, such as extreme mass-ratio inspirals 
(EMRIs), LISA-like missions will measure hundred of thousands of orbital cycles; because suc- 
cessful detections require matching the phase of signals throughout their evolution, it follows that 
these observations will be exquisitely sensitive to source parameters. The data-analysis detection 
problem will be correspondingly delicate, but has been tackled both theoretically |279j . and in a 
practical program of mock data challenges for LISA [34, 35, 364 . 

The rest of this review is organized as follows. Section [2] provides the briefest overview of Ein- 
stein's GR, of the theoretical framework in which it can be tested, and of a few leading alternative 
theories. It also introduces the "black-hole paradigm," which augments Einstein's equations with 
a few assumptions of physicality that lead to the prediction that the end result of gravitational 
collapse are black holes described by the Kerr metric. Section [3] reviews the "classic" LISA archi- 
tecture, as well as possible options for LISA-like variants. Section [4] summarizes the main classes 
of GW sources that would be observed by LISA-like detectors, and that can be used to test GR. 
Section [5] examines the tests of gravitational dynamics that can be performed with these sources, 
while section [6] discusses the tests of the black- hole nature and structure. (A conspicuous omission 
are possible stochastic GW backgrounds of cosmological origin [66]; indeed, in this article we do 
not discuss the role of space-based detectors as probes of cosmology and early-Universe physics.) 
Last, section [7] presents our conclusions and speculations. 
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2 The theory of gravitation 



Newton's theory of gravitation provided a description of the effect of gravity through the inverse 
square law without attempting to explain the origin of gravity. The inverse square law provided 
an accurate description of all measured phenomena in the Solar System for more than two hundred 
years, but the first hints that it was not the correct description of gravitation began to appear in 
the late nineteenth century, as a result of the improved precision in measuring phenomena such 
as the perihelion precession of Mercury. Einstein's contribution to our understanding of gravity 
was not only practical but also aesthetic, providing a beautiful explanation of gravity as the 
curvature of spacetime. In developing GR as a generally covariant theory based on a dynamical 
spacetime metric, Einstein sought to extend the principle of relativity to gravitating systems, and 
he built on the crucial 1907 insight that the equality of inertial and gravitational mass allowed the 
identification of inertial systems in homogeneous gravitational fields with uniformly accelerated 
frames — the principle of equivalence [278] . Einstein was also guided by his appreciation of Ricci 
and Levi-Civita's absolute differential calculus (later to become differential geometry), arguably as 
much as by the requirement to reproduce Newtonian gravity in the weak- field limit. Indeed, one 
could say that GR was born "of almost pure thought" [382] . 

Einstein's theory of general relativity is described by the action 



in which g is the determinant of the spacetime metric and R = g^R^ u is the Ricci scalar, where 



= R%u is the Ricci Tensor, R^ s = - T^ >s + T^ x T x ps - T^T^ the Riemann curvature 



tensor, = g aS {g/38,-y + 98"/, p ~ 90j,s) the affine connection, and a comma denotes a partial 
derivative. When coupled to a matter distribution, this action yields the field equations 



where denotes the stress-energy tensor of the matter. 

Since the development of the theory, GR has withstood countless experimental tests [3821 13581 
1359] based on measurements as different as atomic-clock precision |306j . orbital dynamics (most 
notably lunar laser ranging [256] ). astrometry [331] . and relativistic astrophysics (most exquisitely 
the binary pulsar [241, 382 , but not only [302] ). It is therefore the correct and natural benchmark 
against which to compare alternative theories using future observations and we will follow the same 
approach in this article. Unlike in the case of Newtonian gravity at the time that GR was developed, 
there are no current observations that GR cannot explain that can be used to guide development 
of alternative^] Nonetheless, there are crucial aspects of Einstein's theory that have never been 
probed directly, such as its strong-field dynamics and the propagation of field perturbations (GWs). 
Furthermore, it is known that classical GR must ultimately fail at the Planck scale, where quantum 
effects become important, and traces of the quantum nature of gravity may be accessible at lower 
energies [320J . As emphasized by Will [382], GR has no adjustable constants, so every test is 
potentially deadly, and a probe that could reveal new physics. 

2.1 Will's "standard model" of gravitational theories 

Will's Living Review [382 and his older monograph [385] are the fundamental references about 
the experimental verification of GR. In this section, we give only a brief overview of what may 

1 Some have argued that measurement of cosmological dark energy effects may be explained by long-range modifi- 
cations of Einstein's equations. While space-based GW observations may potentially help to refine redshift-distance 
measurements, we generally take the possibility of cosmological scale GR alternatives as falling outside the scope of 
this review. 




(1) 
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be called Will's "standard model" for alternative theories of gravity, which proceeds through four 
steps: a) strong evidence for the equivalence principle supports a metric formulation for gravity; 
b) metric theories are classified according to what gravitational fields (scalar, vector, tensor) they 
prescribe; c) slow-motion, weak-field conservative dynamics are described in a unified parametrized 
post- Newtonian (PPN) formalism, and constrained by experiment and observations; d) finally, 
equations for the slow-motion generation and weak-field propagation of gravitational radiation are 
derived separately for each metric theory, and again compared to observations. Many of the tests 
of gravitational physics envisaged for LISA belong in this last sector of Will's standard model, and 
are discussed in Sec. 5.1 of this review. This scheme however leaves out two other important points 
of contact between gravitational phenomenology and LISA's GW observations: the strong-field, 
non-linear dynamics of black holes and their structure and excitations, especially as probed by 
small orbiting bodies. We will deal with these in Sees. [5]and|6j respectively; but let us first delve 
into Will's standard model. 



The equivalence principle and metric theories of gravitation. Einstein's original intuition 
|277j placed the equivalence principle [183] as a cornerstone for the theories that describe gravity 
as curved spacetime. As formulated by Newton, the principle states simply that inertial and 
gravitational mass are proportional, and therefore all "test" bodies fall with the same acceleration 
(in modern usage, this is known as the weak equivalence principle, or WEP). Dicke later recognized 
that in developing GR Einstein had implicitly posited a broader principle (Einstein's equivalence 
principle, or EEP) that consists of WEP plus local Lorentz invariance and local position invariance: 
that is, of the postulates that the outcome of local non- gravitational experiments is independent 
of, respectively, the velocity and position of the local freely falling reference frames in which the 
experiments are performed. 

Turyshev [358J gives a current review of the experimental verification of WEP (shown to hold 
to parts in 10 13 by differential free-fall tests |319j ). local Lorentz invariance (verified to parts in 
10 22 by clock-anisotropy experiments [228]), and local position invariance (verified to parts in 10 5 
by gravitational-redshift experiments |50j , and to much greater precision when looking for possible 
time variations of fundamental constants |360| ). Although these three parts of EEP appear distinct 
in their experimental consequences, their underlying physics is necessarily related in any theory of 
gravity, so Schiff conjectured (and others argued convincingly) that any complete and self-consistent 
theory of gravity that embodies WEP must realize also EEP 382 . 

EEP leads to metric theories of gravity in which spacetime is represented as a pseudo-Ricman- 
nian manifold, freely-falling test bodies move along the geodesies of its metric, and non-gravitational 
physics is obtained by applying special-relativistic laws in local freely falling frames. GR is, of 
course, a metric theory of gravity; so are scalar-vector-tensor theories such as Brans-Dicke theory, 
which include other gravitational fields in addition to the metric. By contrast, theories with dy- 
namically varying fundamental constants and theories (such as superstring theory) that introduce 
additional WEP-violating gravitational fields |382|, Sec. 2.3] are not metric. Neither are theories 
that provide short-range and long-range modifications to Newton's inverse-square law [BJ. 

The scalar and vector fields in scalar-vector-tensor theories cannot directly affect the motion 
of matter and other non-gravitational fields (which would violate WEP), but they can intervene 
in the generation of gravity and modify its dynamics. These extra fields can be dynamical (i.e., 
determined only in the context of solving for the evolution of everything else) or absolute (i.e., 
assigned a priori to fixed values). The Minkowski metric of special relativity is the classic example 
of absolute field; such fields may be regarded as philosophically unpleasant by those who dislike 
feigning hypotheses, but they have a right of citizenship in modern physics as "frozen in" solutions 
from higher energy scales or from earlier cosmological evolution. 

The additional fields can potentially alter the outcome of local gravitational experiments: while 
the local gravitational effects of different metrics far away can always be erased by describing physics 
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in a freely falling reference frame (which is to say, the local boundary conditions for the metric 
can be arranged to be flat spacetime), the same is not true for scalar and vector fields, which can 
then affect local gravitational dynamics by their interaction with the metric. This amounts to a 
violation not of EEP, but of the strong equivalence principle (SEP), which states that EEP is valid 
also for self-gravitating bodies and gravitational experiments. SEP is verified to parts in 10 4 by 
combined lunar laser- ranging and laboratory experiments [3S8J . So far, GR appears to be the only 
viable metric theory that fully realizes SEP. 

The PPN formalism. Because the experimental consequences of different metric theories follow 
from the specific metric that is generated by matter (possibly with the help of the extra gravi- 
tational fields), and because all these theories must realize Newtonian dynamics in appropriate 
limiting conditions, it is possible to parametrize them in terms of the coefficients of a slow-motion, 
weak-field expansion of the metric. These coefficients appear in front of gravitational potentials 
similar to the Newtonian potential, but involving also matter velocity, internal energy, and pres- 
sure. This scheme is the parametrized-post- Newtonian formalism, pioneered by Nordtvedt and 
extended by Will (see 385 for details). 

Of the ten PPN parameters in the current version of the formalism, two are the celebrated 7 
and j3 (already introduced by Eddington, Robertson, and Schiff for the "classical" tests of GR) that 
rule, respectively, the amount of space curvature produced by unit rest mass and the nonlinearity 
in the superposition of gravitational fields. In GR, 7 and j3 each have the value 1. The other 
eight parameters, if not zero, give origin to violations of position invariance (£), Lorentz invariance 
(ai-3), or even of the conservation of total momentum (0:3, C1-4) an d total angular momentum 

("1-3) Gl-4.)- 

The PPN formalism is sufficiently accurate to describe the tests of gravitation performed in the 
Solar system, as well as many tests using binary-pulsar observations. The parameter 7 is currently 
constrained to 1 ± a few 10 -5 by tests of light delay around massive bodies using the Cassini 
spacecraft [64]; f3 to 1 ± a few f0~ 4 by Lunar laser ranging 388 The other PPN parameters 
have comparable bounds around zero from Solar-system and pulsar measurements, except for 03, 
which is known exceedingly well from pulsar P statistics [382] . 

2.2 Alternative theories 

Tests in the PPN framework have tightly constrained the field of viable alternatives to GR, largely 
excluding theories with absolute elements that give rise to preferred-frame effects [382) . The 
(indirect) observation of GW emission from the binary pulsar and the accurate prediction of its 
P by Einstein's quadrupole formula have definitively excluded other theories }382[ 1337] , Yet more 
GR alternatives were conceived to illuminate points of principle, but they are not well motivated 
physically and therefore are hardly candidates for experimental verification. Some of the theories 
that are still "alive" are described in the following. 

2.2.1 Scalar tensor theories. 

The addition of a single scalar field 4> to GR produces a theory described by the Einstein- frame 
action (see, e.g., |382j ). 

7= (tevrG)- 1 / [R- 2g> MV d fl (pd„cp - V (?)](- c,) 1 ' 2 <Px + I m (^j m , A 2 ^)^), (3) 



2 In this context, laser ranging constrains the Nordtvedt effect (the dependence of free fall for massive bodies 
on gravitational self-energy), which is also a violation of SEP. The j3 constraint given in 388 assumes the Cassini 
result 64 for 7. 
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where g^ is the metric, the Ricci curvature scalar R yields the general-relativistic Einstein-Hilbert 
action, and the two adjacent terms are kinetic and potential energies for the scalar field. Note that 
in the action I m for matter dynamics, the metric couples to matter through the function A(ip), so 
this representation is not manifestly metric; it can however be made so by a change of variables 
that yields the Jordan-frame action, 

I = (167TG)- 1 J[<j>R - ^uWgTdntdvt - ^ 2 V]{-g) 1/2 d A x + I m (^ ro , g^), (4) 

where <fi = A{ip)~ 2 is the transformed scalar field, g^ v = A 2 {ip)g^ v is the physical metric "observed" 
by test bodies, and 3 + 2u)(<p) = [d(ln A(<p)) / d(p]~ 2 . 

The "classic" Brans-Dicke theory corresponds to fixing w to a constant wbd > and it is indistin- 
guishable from GR in the limit cjbd — > oo. In the PPN framework, the only parameter that differs 
from GR is 7 — (1 + cjbd)/ (2 + wbd)- Damour and Esposito-Farese [112] considered an expansion 
of log A(ip) around a cosmological background value, 

log AO) = a (<p-<p ) + ^/3 Q ((p - ipo) 2 H , (5) 

where /?o (and further coefficients) = reproduces Brans-Dicke with a 2 , = 1/(2wbd + 3), f3 > 
causes the evolution of the scalar field toward tpo (and therefore toward GR); and j3o < may 
produce spontaneous scalarization inside objects like neutron stars, leading to large SEP violations. 
These parameters are bound by solar-system, binary-pulsar, and GW observations |113j . 

Scalar-tensor theories have found motivation in string theory and cosmological models, and 
have attracted the most attention in terms of tests with GW observations. 



2.2.2 Vector tensor theories. 

These are obtained by including a dynamical vector field it M coupled to the metric tensor. The 
most general second-order action in such a theory takes the form 



S = I d 4 z ^g [-2A + (1 + u Ull u v )R - K^u^uf v + A(tt M u" + 1) 



16ttG . 

where K£p = ag^ g a p + c 2 S£5p + c 3 S^S^ - C4U^u u g a/3 , (6) 

in which a semicolon denotes covariant differentiation, and the coefficients Cj are arbitrary con- 
stants. There are two types of vector-tensor theories: in unconstrained theories, A = and the 
constant cj is arbitrary, while in Einstein-aether theories the vector field u M is constrained to have 
unit norm, so the Lagrange multiplier A is arbitrary and the constraint allows u) to be absorbed 
into a rescaling of G. For the unconstrained theory, only versions of the theory with C4 = have 
been studied and for these the field equations are |385j 

RnX ~ \RQkX + "0$ + ueg + eO^ + T Q$ + Kg KX = 0, 

0*0 _ „ „,, P , „2 p . _ 1 „ ,„2d_ f„,2\ _i I I 2 



eg = 2u a u [K R x]n - 



«A = u k u x r + u R k\ - ^9k\u R-(u ). k x + g K \O g u 
a - \gK\u a u t3 R a p - {u a u [K ). x]a + ^n g (u K u x ) + \g K \{u a u l3 ). al 3 



T 

eg = -2(F«F Xa - \g KX F a0 F^) 

©( T ) : ct . a 1 a:8 1 I ex. a ;ct\ 

kX = u K . a u\ + u a - K u. x - -g K \u a - t pu < p + (u u [k . x] - u. [k u x] ~ u [K u' x] ). a (7) 
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where F^ = u v . M - u^ v , u 2 = u^, r] = -c 2 , e = -(c 2 + c 3 )/2, and r = -(cj. + c 2 + c 3 ). We use 
the usual subscript notation, such that "q" and "n" denote symmetric and antisymmetric sums. 
In the constrained Einstein-aether theory the held equations are 



J fj,;a C4U U.^ — 

where J a ^ = 

G a fs = 

1 a p — 



rp(u) 87rG matter 

- L a/3 a 1 a/3 



+ [U v jf^ - C 4 U 2 ] U a U/3 - ^g a i3L u 



(8) 



L,, 



•iv u ^a u "-fi i s the aether Lagrangian, and T™ ttcv is the usual matter 



stress-energy tensor [131j . Via field redefinition this theory can be shown to be equivalent to GR 
if ci + C4 = 0, ci + c 2 + c 3 = 0, and c 3 = ±^/c\(ci — 2) [15] • Field redefinition can also be used 
to set ci + C3 = |150j : if this constraint is imposed then equivalence to GR is only achieved if 
the Ci are all zero. This constraint is therefore appropriate to pose Einstein-aether theory as an 
alternative to test against GR, since then any non-zero values of the Cj would represent genuine 
deviations from GR. 

Unconstrained vector-tensor theories were introduced in the 1970s as a straw-man alternative 
to GR |385j , but they have four arbitrary parameters and leave the magnitude of the vector field 
unconstrained, which is a serious defect. Interest in Einstein-aether theories was prompted by the 
desire to construct a covariant theory that violated Lorentz invariance under boosts, by having a 
preferred reference frame — the aether, represented by the vector . The preferred reference frame 
also provides a universal notion of time [166 . Interest in theories that violate Lorentz symmetry 
has recently been revived as a possible window onto aspects of quantum gravity |26j . 



2.2.3 Scalar vector tensor theories. 

The natural extension of scalar-tensor and vector-tensor theories are scalar-vector-tensor theories 
in which the gravitational field is coupled to a vector field and one or more scalar fields. These 
theories are relativistic generalizations of Modified Newtonian Dynamics (MoND) , which was pro- 
posed in order to reproduce observed rotation curves on galactic scales. The relativistic extensions 
were designed also to satisfy cosmological observations on larger scales. The action takes the form 



S 



1 

16^r 



d 4 x 



g{L c 



Lu. + Lr 



(9) 



where Lq = (i? — 2A)/G and L ma ttor are the usual gravitational and matter Lagrangians. There 
are two main versions of the theory, which differ in the choice of the scalar-field and vector-field 
Lagrangians L$ and L u . 

In Tensor- Vector-Scalar gravity (TeVeS) [53j there is one dynamical vector field (f> and a non- 
dynamical vector field a. The Lagrangians are 



Ls = 



1 

2G 
K 
2G 



a 2 (ft* 



u a u p ) 



1 G 



g a0 g^B afi B 0v 



4 



2P 
(ti"u M - 1) 



o^F{Go 2 ) 



(10) 



where F is an unspecified dimensionless function, K is a dimcnsionlcss parameter and B a p — 
up >a — u a ,p. In TeVeS the physical metric that governs the gravitational dynamics of ordinary 
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matter does not coincide with g^, but is determined by the scalar field through 

9p.v = e^g^u - 1UpU v sinh(20). (11) 

An alternative version of TeVeS, called Bi- Scalar-Tensor- Vector gravity (BSTV) has also been 
proposed |314j . in which the scalar field a is allowed to be dynamical. TeVeS is able to explain 
galaxy rotation curves and satisfies constraints from cosmology and gravitational lensing, but stars 
are very unstable [322] and the Bullet Cluster [97 observations (which point to dark matter) cannot 
be explained. 

In Scalar-Tensor- Vector Gravity (STVG) [263] the Lagrangian for the vector field is taken to 

be 

L u =lo [B^B^ - 2fi 2 u^ + V u (u)] , (12) 

with Bp V defined as before. The three constants w, fi, and G that enter this action and the 
gravitational action are then taken to be scalar fields governed by the Lagrangian 



16tt 
Ls = -q 



1 u P ( G. V G. P ^m,p \ M Vg(G) VM 



(13) 



It is claimed that STVG predicts no deviations from GR on the scale of the Solar System or for 
small globular clusters [265) . and that it can reproduce galactic rotation curves |76j . gravitational 
lensing in the Bullet Cluster [77], and a range of cosmological observations [264] . It has been 
proposed that an extension of the ESA-led LISA Pathfinder technology-demonstration mission 
may allow additional constraints on this class of theories |248j . To date the consequences of TeVeS 
or STVG for GW observations has not been investigated. 



2.2.4 Modified-action theories 

f(R) gravity. This theory is derived by replacing R with an arbitrary function f(R) in the 
Einstein -Hubert action. There are two versions of f(R) gravity. In the metric formalism the 
action is extremized with respect to the metric coefficients only, and the connection is taken to be 
the metric connection, depending on the metric components in the standard way. The resulting 
field equations are 

( /.V /.Yx + g^R^RViR) + {-R- K x + g KX nR)f"{R) + R K \f'(R) - \g K xf{R) = 0. (14) 

In the Palatini formalism, the field equations are found by extremizing the action over both the 
metric and the connection. For an f{R) action the resulting equations are 

Rnxf(R) - \g K \f{R) = o, v Q [V=gf(R)g^] = o. (15) 

If the second derivative f"(R) ^ 0, metric f(R) gravity can be shown to be equivalent to a Brans- 
Dicke theory with wbd = 0, while Palatini f(R) gravity is equivalent to a Brans-Dicke theory 
with wbd = — 3/2, with no constraint imposed on f(R) |1441 1335] , In both cases, the Brans-Dicke 
potential depends on the exact functional form f(R). 

f(R) theories have attracted a lot of interest in a cosmological context, since the flexibility in 
choosing the function f(R) allows a wide range of cosmological phenomena to be described [27S\ 185). 
including inflation [338, 373 and late-time acceleration [83, 87J, without violating constraints from 
Big Bang Nucleosynthesis |134j . However, metric f(R) theories are ruled out by Solar System 
measurements if the scalar degree of freedom is assumed to be long-ranged, which results in a PPN 
parameter 7 = 1/2 |95j . excluded by current data. This problem can be avoided by assuming a 
short-range scalar field, but then f(R) theories can only explain the early expansion of the Universe 
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and not late-time acceleration. The Chameleon mechanism |217) has been invoked to circumvent 
this, as it allows the scalar-field mass to be a function of curvature, so that the field can be short 
ranged within the Solar System but long ranged on cosmological scales. 

There are also other issues with f(R) theories. For example, in Palatini f(R) gravity the post- 
Newtonian metric depends on the local matter density 334 , while in metric f(R) gravity with 
f"(R) < there is a Ricci-scalar instability [120] that arises because the effective gravitational 
constant increases with increasing curvature, leading to a runaway instability for small stars. We 
refer the reader to |3351 1115] for more complete reviews of the current understanding of f(R) 
gravity. 



Chern-Simons gravity. Yunes and others H El E2 E3 HQ M 1123 ESS EH [US [2251 12S2 
332, 393, 395] I400| 14031 1405] have recently developed an extensive analysis of the observational 
consequences of Jackiw and Pi's Chern-Simons gravity 208 , which extends the Hilbert action 
with an additional Pontryagin term *RR that is quadratic in the Riemann tensor [174] : 

1= (167TG)- 1 f[R- ^e*RR}(-g) 1 / 2 d i x + I m ^ m ,g fiV ); (16) 

here *RR = *R a b cd R b ac d is built with the help of the dual Riemann tensor *R a b cd = \e cde} R a be ^ 
and it can be expressed as the divergence of the gravitational Chern-Simons topological current r\ 
the scalar field 9 can be treated either as a dynamical quantity, or an absolute field. In both cases, 
*RR vanishes, either dynamically, or as a constraint on acceptable solutions, needed to enforce 
coordinate-invariant matter dynamics, which restricts the space of solutions available to GR. 

Chern-Simons gravity is motivated by string theory and by the attempt to develop a quantum 
theory of gravity satisfying a gauge principle. The Pontryagin term arises in the standard model of 
particle physics as a gauge anomaly: the classical gravitational Noether current that comes from 
the symmetry of the gravitational action is no longer conserved when the theory is quantized, but 
has a divergence proportional to the Pontryagin term. This anomaly can be cancelled by modifying 
the action via the addition of the Chern-Simons Pontryagin term. The same type of correction 
arises naturally in string theory through the Green-Schwarz anomaly-cancelling mechanism, and 
in Loop Quantum Gravity to enforce parity and charge-parity conservation. 

The presence of the Chern-Simons correction leads to parity violation which has various ob- 
servable consequences, with magnitude depending on the Chern-Simons coupling, which string 
theory predicts will be at the Planck scale. If so, these effects will never be observable, but various 
mechanisms have been proposed that could enhance the strength of the Chern-Simons coupling, 
such as non-perturbative instanton corrections |348j . fermion interactions |llj . large intrinsic cur- 
vatures [TU] or small string couplings at late times [380 . For further details on all aspects of 
Chern-Simons gravity, we refer the reader to the review |12j . 



General quadratic gravity. This theory arises by adding to the action all possible terms that 
are quadratic in the Ricci scalar, Ricci tensor, and Riemann tensor. For the action 

S = J V^g (-2A + R + aR 2 + (3R aT R crT + -/R^sR 01 ^ 5 ) d 4 x (17) 

the field equations are 301 

Rk\ ~^R9n\ + aK KX + (3L kX + A.g KA = 0, 

3 In field theory, topological currents are those whose conservation follows not from the equations of motion, but 
from their very geometric construction. 
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K kX = -2R, kX + 2g KX UR - -R 2 g K x + 2RR kX , 

L kX =-2R a K . aX + nR KX + 1 -g KX nR- l -g, x R aT R° T + 2R a K R ljX (18) 



More recently, Stein and Yunes |406j considered a more general form ol quadratic gravity that 
includes the Pontryagin term from Chern-Simons gravity. Their action was 

S = J \f^g{nR + ai h{e)R 2 + a 2 f 2 (9)R ab R ab + a 3 f 3 (9)R abcd R abcd 

+a 4 f 4 (9)R* abcd R abcd ~ t [W a 9W a 9 + 2V(9)} + £ mat J (19) 

in which the cti and /3 are coupling constants, 9 is a scalar field, and £ ma t is the matter Lagrangian 
density. There are two versions of this theory: a non- dynamical version in which the functions 
fi(9) are constants, and a dynamical version in which they are not. 

General quadratic theories are known to exhibit ghost fields — negative mass-norm states that 
violate unitarity (see, for instance, [335j for a discussion and further references). These occur 
generically, although models with an action that is a function only of R and R 2 — AR pv R llv + 
RnvpaR tlvpa only are ghost- free jlOOj . Ghost fields are also present in Chern-Simons modified 
gravity |2671 1125] . which places strong constraints on the parameters of that model. 

2.3 The black-hole paradigm 

The present consensus is that all of the compact objects observed to reside in galactic centers 
are supermassive black holes, described by the Kerr metric of GR [305]. This explanation follows 
naturally in GR from the black-hole uniqueness theorems and from a set of additional assumptions 
of physicality, briefly discussed below. If a deviation from Kerr is inferred from GW observations, 
it would imply that the assumptions are violated, or possibly that GR is not the correct theory 
of gravity. Space-based GW detectors can test "black-hole Kerr-ness" by measuring the GWs 
emitted by smaller compact bodies that move through the gravitational potentials of the central 



objects (see Sec. 6.2 1. Kerr-ness is also tested by characterizing multiple ringdown modes in the 



final black hole resulting from the coalescence of two precusors (see Sec. 6.3 1. Both classes of tests 
probe the static sector of gravity theories (i.e., the occurrence of specific spacetime solutions for 
isolated bodies) rather than their dynamic sector (e.g., the two-body dynamics that ensues from 
the action). 

The current belief that Kerr black holes are ubiquitous follows from work on mathematical 
aspects of GR in the middle of the twentieth century. Oppenheimer and Snyder demonstrated 
that a spherically symmetric, pressure-free distribution would collapse indefinitely to form a black 
hole 283]. This result was assumed to be a curiosity due to spherical symmetry, until it was 
demonstrated by Penrose [284] and by Hawking and Penrose [185] that singularities arise inevitably 
after the formation of a trapped surface during gravitational collapse. Around the same time, it 
was proved that the black- hole solutions of Schwarschild [32 lj and Kerr |215j are the only static 
and axisymmetric black-hole solutions in GR [2071 [833 I307j . These results together indicated the 
inevitability of black-hole formation in gravitational collapse. 

The assumptions that underlie the proof of the uniqueness theorem are that the spacetime is 
a vacuum solution, that it is asymptotically flat, and that it contains an event horizon but no 
closed timelike curves (CTCs) exterior to the horizon [184J. The lack of CTCs is needed to ensure 
causality, while the requirement of a horizon is a consequence of the Cosmic Censorship Hypothesis 
(CCH) [285) . The CCH embodies this belief by stating that any singularity that forms in Nature 
must be hidden behind a horizon (i.e., cannot be naked), and therefore cannot affect the rest of 
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the Universe, which would be undesirable because GR can make no prediction of what happens 
in its vicinity. However, the CCH and the non-existence of CTCs are not required by Einstein's 
equations, and so they could in principle be violated. 

Besides the Kerr metric, we know of many other "black-hole-like" solutions to Einstein's equa- 
tions: these are vacuum solutions with a very compact central object enclosed by a high-redshift 
surface. In fact, any metric can become a solution to Einstein's equation: it is sufficient to insert 
it in the Einstein tensor, and postulate the resulting "matter" stress-energy tensor as an input to 
the equations. However, such matter distributions will not in general satisfy the energy conditions 
(see, e.g., ESU): 

• The weak energy condition is the statement that all timelike observers in a spacetime 
measure a non-negative energy density, T^ v v^v v > 0, for all future-directed timelike vectors 
v 1 *. The null energy condition modifies this condition to null observers by replacing ?/' by 
an arbitrary future-directed null vector k^. 

• The strong energy condition requires the Ricci curvature measured by any timelike 
observer to be non-negative, (7),„ — T^g^ v /2)v^v v > 0, for all timelike v^. 

• The dominant energy condition is the requirement that matter flow along timelike or 
null world lines: that is, that —T^v" be a future-directed timelike or null vector field for any 
future-directed timelike vector v^. 

These conditions make sense on broad physical grounds; but even after imposing them, there 
remain several black-hole-like solutions 342 besides Kerr. 

Thus, space-based GW detectors offer an important test of the "black-hole paradigm" that 
follows from GR plus CCH, CTC non-existence, and the energy conditions. This paradigm is 
especially important: putative black holes are observed to be ubiquitous in the Universe, so their 
true nature has significant implications for our understanding of astrophysics. 

If one or many non-Kerr metrics are found, the hope is that observations will allow us to tease 
apart the various possible explanations: 

1. Does the spacetime contain matter, such as an accretion disk, exterior to the black-hole? 

2. Are the CCH, the no-CTC assumption, or the energy conditions violated? 

3. Is the central object an exotic object, such as a boson star |313| 1216] ? 

4. Is gravity coupled to a Yang-Mills or Skyrme field? This is known to lead to different black- 
hole solutions [2201 13161 1330] . although these are known (for Yang-Mills |343j ) or suspected 
(for Skyrme [123] ) to be unstable to generic perturbations. 

5. Is the theory gravity just different from GR? For instance, in Chern-Simons gravity black 
holes differ from Kerr in their octupole moment |400| , and this correction may produce the 
most significant observational signature in GW observations 332 . 

These questions are challenging, but much will be possible. The tests of black-hole structure with 
space-based GW detectors are discussed in detail in Sec. [6j 
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3 Space-based missions to detect GWs 



The experimental search for gravitational waves began in the 1960s with Joseph Weber's resonant 
bars (and resonant claims [98); it has since grown into an extensive international endeavor that 
has produced a network of km-scale GW interferometers (LIGO g], EGO/VIRGO PQ, GEO600 
[2J, and Tama/KAGRA [3]), the proposals for space-based observatories such as LISA, and the 
effort to detect GWs by using an array of pulsars as reference clocks [192] . 




In this section we briefly describe the architecture of LISA-like space-based GW observatories, 
beginning with the "classic" LISA design, and then discussing the variations studied in the 2011- 
2012 ESA and NASA studies [2011255]. 

3.1 The classic LISA architecture 

The most cited LISA reference is perhaps the 1998 pre-phase A mission study 353 ; a more 
up-to-date review of the LISA technical architecture is given by Jennrich }210j . while the LISA 
science-case document [29 8) describes the state of LISA science at the end of the 2000s. Here we 
give only a quick review of the elements of the mission, referring the reader to those references for 
in-depth discussions. 

LISA principles. LISA consists of three identical cylindrical spacecraft, approximately 3 m wide 
and 1 m high, that are launched together and, after a 14-month cruise, settle into an Earth-like 
heliocentric orbit, 20° behind the Earth. The orbit of each spacecraft is tuned slightly differently, 
resulting in an equilateral-triangle configuration with 5 x 10 6 km arms (commensurate with the 
LISA sensitivity band), inclined by 60° with respect to the ecliptic. This configuration is main- 
tained to 1% by orbital dynamics alone for the lifetime of the mission, nominally 5 years, with a 
goal of 10. In the course of a year, the center of the LISA triangle completes a full revolution on 
the ecliptic, while the triangle itself, as well as its normal, rotate through 360°. 

LISA detects GWs by monitoring the fluctuations in the distances between freely falling refer- 
ence bodies — in LISA, platinum-gold test masses housed and protected by the spacecraft. LISA 
uses three pairs of laser interferometric links to measure inter-spacecraft distances with errors near 
10 pm Hz -1 / 2 . The phase shifts accumulated along different interferometer paths are proportional 
to an integral of GW strain along those trajectories. The test masses suffer from residual acceler- 
ation noise at a level of 3 x 10~ 15 m s~ 2 , which is dominant below 3 mHz. Together, the position 
and acceleration noises (divided by a multiple of the armlength) determine the LISA sensitivity to 
GW strain, which reaches 10 -20 Hz -1 / 2 at frequencies of a few mHz. 

Alternatively (but equivalently) , we may describe LISA as measuring the GW- induced relative 
Doppler shifts between the local lasers on each spacecraft and the remote lasers. These Dopplcr 
shifts are directly proportional to a difference of instantaneous GW strains (as experienced by the 
local spacecraft at the time of measurement, and by the distant spacecraft at a time retarded by 
the LISA armlength divided by the speed of light). In either description, the 1% "breathing" of 
the LISA constellation occurs at frequencies that are safely below the LISA measurement band. 

LISA technology. The distance measurement between the test masses along each arm is in 
fact split in three: an inter-spacecraft measurement between the two optical benches, and two 
local measurements between each test mass and its optical bench. To achieve the inter-spacecraft 
measurement, 2 W of 1064-nm light are sent and received through 40-cm telescopes. The diffracted 
beams deliver only 100 pW of light to the distant spacecrafts, so they cannot be reflected directly; 
instead, the laser phase is measured and transponded back by modulating the frequency of the 
local lasers. Each of the two optical assemblies on each spacecraft include the telescope and a 
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Zerodur optical bench with bonded fused-silica optical components, which implements the inter- 
spacecraft and local interferometers, as well as a few auxiliary interferometric measurements, which 
are needed to monitor the stability of the telescope structure, to control point-ahead corrections, 
and to compare the two lasers on each spacecraft. The LISA phasemeters digitize the signals 
from the optical-bench photodetectors at 50 MHz, and multiplies them with the output of local 
oscillators, computing phase differences and driving the oscillators to track the frequency of the 
measured signal. This heterodyne scheme is needed to handle Doppler shifts (as well as laser 
frequency offsets) as large as 15 MHz. 

Because intrinsic fluctuations in the laser frequencies are indistinguishable from GWs in the 
LISA output, laser frequency noise needs to be suppressed by several orders of magnitude, using a 
hierarchy of techniques |210j : the lasers are prestabilized to local frequency references; arm-locking 
may be used to further stabilize the lasers using the LISA arms (or their differences) as stable 
references; finally Time-Delay Inter ferometry [118] is applied in post processing to remove residual 
laser frequency noise by algebraically combining appropriately delayed single-link measurements, 
in such a way that all laser-frequency-noise terms appear as canceling pairs in the combination. 
With this final step, the LISA measurements combine into synthesized-interferometer observables 
|361j analogous to the readings of ground-based interferometers such as LIGO. 

The LISA disturbance reduction system (DRS) minimizes the deviations of the test masses from 
free-fall trajectories, by shielding them from solar radiation pressure and interplanetary magnetic 
fields. On each spacecraft, the DRS includes two gravitational reference sensors (GRSs): a GRS 
consists of a 2-kg test mass, enclosed in an electrode housing with capacitive reading and control 
of test-mass position and orientation, and accompanied by additional components to cage and 
reposition the test mass, to maintain vacuum, and to control the accumulation of charge. The 
other crucial part of the DRS are the micro-Newton thrusters (on each spacecraft, three clusters 
of four colloid or field-emission-electric propulsion systems), which are controlled in response to 
the GRS readings to maintain the nominal position of the test mass with respect to the spacecraft. 
This is known as drag-free control. The thrusters need to provide up to 100 fj,N force with < 0.1 
/zN noise. In addition to this active correction, test-mass acceleration noise is minimized by the 
accurate knowledge and correction of spacecraft self-gravity, by enforcing magnetic cleanliness, and 
by controlling thermal fluctuations. A version of the LISA DRS with slightly lower performance 
will be flown and tested in LISA Pathfinder [2521 [28] . a single-spacecraft technology precursor 
mission. 

The LISA response to GWs. Compared to ground-based interferometers, the LISA response 
to GWs is both richer and more complex |133l 13771 13621 1108"] . First, the revolution and rotation of 
the LISA constellation imprint a sky-position-dependent signature on long-lasting GW signals (for 
LISA, these include all binary signals): the revolution causes a time-dependent Doppler shift with 
a period of a year, and a fractional amplitude of 10 -4 ; the rotation introduces a 1-year periodicity 
in the LISA equivalent of the ground-based-interferometer antenna patterns |247j . endowing an 
incoming monochromatic GW signal with eight sidebands, separated by yr _1 = 3.17 x 10 -8 Hz. 
These effects are sometimes referred to as the LISA AM and FM modulations. Thus, GW signals 
as measured by LISA carry information regarding the position of the source; on the other hand, 
data analysis must account for these effects, possibly including sky position among the parameters 
of matched-filtering search templates. 

Second, the long-wavelength approximation, whereby the entire "interferometer" shrinks and ex- 
pands as one, cannot be used throughout the LISA band; indeed, the wavelength of GWs meets the 
LISA armlength for a frequency of 60 mHz. As a consequence, the LISA response to a few-second 
impulsive GW is not a single pulse, but a collection of pulses with amplitudes and separations 
dependent on the sky position and polarization of the source; the effect on high-frequency chirping 
signals is more subtle, but still present. This further complicates data analysis, and introduces an 
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independent mechanism (a triangulation of sorts) to localize GW sources of short duration. 

Third, LISA is in effect three detectors in one: this can be understood most easily by considering 
that subsets of the three LISA arms form three separate Michelson-like interferometers (known in 
LISA lingo as X, Y, and Z) at 120° angles. More formally, the LISA interferometric measurements 
can be combined into many different TDI observables, some resembling actual optical setups, other 
quite exotic |361) . although at most three observables are independent in the sense that any other 
observable can be reconstructed by time-delaying and summing a generic basis of three observables 
[367J . Furthermore, such a basis can be chosen so that its components have uncorrelated noises, 
much like widely separated ground-based detectors. This is the case of the celebrated A, E, and T 
observables |299j . By symmetry, one of these (T) must correspond to X + Y + Z; and by symmetry, 
T must be relatively insensitive to GWs in the long- wavelength limit, providing for an independent 
measurement of a combination of instrument noises. 

3.2 LISA-like observatories. 

The mission-concept studies ran in 2011-2012 by ESA [30] and NASA [269] embrace several ap- 
proaches to limiting cost. 

Reducing mass is a broadly useful strategy, because it allows for launch on smaller, cheaper 
rockets, and because mass has been shown to be a good proxy for mission complexity, and therefore 
implementation cost. ESA's NGO design envisages interferometric links along two arms rather than 
LISA's three, resulting in an asymmetric configuration with one full and two "half" spacecraft. As 
a consequence, only one TDI observable can be formed, eliminating the capability of measuring 
two combinations of GW polarizations simultaneously. 

Propellant may be saved by placing the spacecraft closer together (with arms ~ 1-3 Mkm rather 
than 5 Mkm) and closer to Earth. At low frequencies, the effect of shorter armlengths is to reduce 
the response to GWs proportionally; for the same test-mass noise, sensitivity then decreases by 
the same ratio. At high frequencies, the laser power available for position measurement increases 
as L~ 2 , since beams are broadly defocused at millions of kms, improving shot noise, but not other 
optical noises, by a factor L^ 1 (in rms). As a consequence, the sweet spot of the LISA sensitivity 
shifts to higher frequencies, although one may instead plan for a less powerful laser, for further 
cost savings. Spacecraft orbits that are different than LISA's, such as geocentric options, or flat 
configurations that lie in the ecliptic plane, would also alter GW-signal modulations, and therefore 
parameter-estimation performance. The NASA report examines such effects briefly 269 . 

Reducing mission duration also saves money, because it reduces the cost of supporting the 
mission from the ground, and it allows for shorter "warranties" on the various subsystems. Con- 
versely, missions are made cheaper by accepting more risk of failure or underperformance, since 
risk is "retired" by extensive testing and by introducing component redundancy, both of which are 
expensive. The studies also explored replacing LISA's custom subsystems with variants already 
flown on other missions (as in OMEGA [188; ), or eliminating some of them altogether (as in LA- 
GRANGE |251j ). However, the significant performance hit and additional risk incurred by such 
steps is not matched by correspondingly major savings, because the main cost driver for LISA-like 
missions is the necessity of launching and flying three (or more) independent spacecraft. Switching 
to atom interferometry would make for very different mission architectures, but the NASA study 
finds that an atom-interferomer mission would face many of the same cost-driving constraints as 
a laser-interferometer mission [269[ l4Tj . 

Indeed, the overarching conclusion of the NASA study is that no technology can provide dra- 
matic cost reductions, and that scientific performance decreases far more rapidly than cost. Thus, 
"staying the course" and pursuing a modestly descoped LISA-like design, whenever programs and 
budgets will allow it, may yet be the most promising strategy for GW detection in space. 
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Figure 1: The discovery space for space-based GW detectors, covering the low-frequency region 
of the GW spectrum, 10~ 5 Hz < / < 0.1 Hz. The discovery space is delineated by the LISA 
threshold sensitivity curve [231] in black, and by the eLISA sensitivity curve in red [21] . This 
region is populated by a wealth of strong sources, often in large numbers. These systems provide 
ample opportunities for astrophysical tests of GR for gravitational-field strengths that are not well 
characterized and studied in conventional astronomy. 

4 A summary of low-frequency gravitational- wave sources 

The low-frequency GW band, from 10 _5 Hz < / < 10 -1 Hz, is astrophysically appealing because it 
is populated by a large number of sources that are relevant to understanding stellar evolution and 
stellar populations, to probing the growth and evolution of massive black holes and their links to 
galaxies, and possibly to observing the primordial universe. Space-based interferometric detectors 
such as LISA and eLISA are sensitive over this entire band. Figure [l] illustrates the GW strength, 
as a function of GW frequency, for a variety of interesting sources classes. The LISA and eLISA 
baseline sensitivities are overlaid, illustrating the potential reach of interferometric detectors. The 
area above the baseline curve is generically called "discovery space;" the height of a source above 
the curve is a rough estimate of the expected signal-to-noise ratio (SNR) with which the source 
could be detected using matched-filtering techniques. 

In addition to their astrophysical applications, sources in nearly all classes are prospective lab- 
oratories for testing gravitation. The different source classes include binary systems of different 
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characters, each of which has distinct possibilities for testing the precepts of GR. Massive black- 
hole (MBH) binaries are strong-field systems that yield GW signals with large SNRs, making signal 
detection and characterization less ambiguous than for weaker sources. This will allow detailed 
exploration of waveform deviations from the predictions of GR. Extreme-mass-ratio inspirals (EM- 
RIs) resulting from the capture of stellar-mass objects by MBHs provide detailed probes of MBH 
spacetimes, thanks to the large number of strong-field waveform cycles that will be observable from 
these systems. Individually resolvable compact Galactic binaries (which have stellar masses) offer 
excellent laboratories for testing GR, because their waveforms and evolution are well understood 
and easily described using model templates. In addition, many such binaries may be detectable 
with both GW and EM observatories, providing opportunities to test the propagation of GWs. 

Many approaches to constraining alternative theories are proposed as null experiments, where 
the assumption is that GW observations will validate the predictions of GR to the level of the 
detector's instrumental noise. The size of any deviations from GR are then constrained by the size 
of the errors in the GW measurement. However, in order to perform these null experiments, it is 
necessary to have accurate models for the GWs predicted in GR. The appropriate modeling scheme 
depends on the system under consideration. For weak-field sources such as compact binaries, the 
waveforms are well described by the quadrupole-moment formula [281)1 1287] . For MBH binaries, 
the early inspiral takes place in a regime of moderate field strengths, which can be well described 
by post-Newtonian expansions in the orbital velocity of the binary [238, 389, 218, 390 . Toward 
the end of the evolution, however, the merger phase can only be tackled with the full numerical- 
relativity solution of the Einstein field equations |296| Ff8 l 140" ! 1318] . For EMRIs, the smaller object 
spends many thousands of orbits in a regime where the velocity is a significant fraction of the speed 
of light, so neither post-Newtonian equations nor numerical relativity apply. However, black-hole 
perturbation theory, in which the mass ratio of the objects is used as an expansion parameter, may 
be used to generate waveforms [292] . 

Additional modeling complications arise from astrophysical phenomena, since tidal coupling, 
extended body effects, and mass transfer can all leave an impact on source evolution, complicating 
the interpretation of the observed gravitational waveforms. The very capabilities of space-based de- 
tectors also introduce complications: in contrast to the sporadic appearance of GWs in the output 
of ground-based interferometers, the data from space-based missions will contain a superposition 
of millions of individual continuous signals. We expect that we will be able to resolve individually 
thousands of the strongest sources. Thus, we must deal with problems of confusion (where a mul- 
titude of signals sums up to form an unresolved stochastic foreground), subtraction (where strong 
signals must be carefully modeled and removed from the data before weaker sources, overlapping 
in frequency, become visible), and global fit (where the parameters of overlapping sources have cor- 
related errors that must be varied together in searches and parameter-estimation studies). These 
issues have not escaped the attention of LISA scientists, and have been tackled both theoretically 
|279j , and in a practical program of mock data challenges [3U [33 1364] . All these issues in data 
analysis and modeling will impact the prospects for tests of GR using the observations. Little work 
has been done on this topic to date, but we will discuss relevant studies where appropriate. 

In the remainder of this section we will briefly describe the three principal source classes ex- 
pected in the mHz band. For each source type, we will discuss the astrophysics of the systems, 
the estimated event rates for (e)LISA observations, the potential astrophysical implications of the 
observations, and the applications of the observations to tests of GR, with pointers to later sections 
that describe the tests in more detail. 

4.1 Massive— black-hole coalescences 

Most (if not all) galactic nuclei come to harbor a massive black hole (MBH) during their evolu- 
tion [244, 305J, and individual galaxies are expected to undergo multiple mergers over their lifetime. 
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It follows that the formation of MBH binaries following galaxy mergers is an expected outcome. 
The mergers of such binaries will be among the strongest sources of low- frequency GWs. The 
rate at which MBH binaries merge in the Universe is uncertain at best, but these events will be 
detectable by LISA-like detectors to extremely large distances, probing an enormous volume of the 
visible Universe. The detection of any MBH mergers, even at a low rate, would produce interesting 
astrophysical results. 

For a circular binary, the mass of a system that merges at a frequency / m is roughly 



the frequency range accessible to space-based GW detectors extends from a few 10 4 Hz to a few 
1CP 1 Hz, which sets the sensitive mass range to ~ 10 4 -10 7 M Q . 

MBH mergers as low-frequency GW sources. Predictions for the observable population of 
black-hole mergers are based on merger-tree structure formation models. The overall merger rate 
depends on the detailed mechanisms of evolutionary growth of the massive black-hole population. 
The energy budgets of active galactic nuclei suggest that MBHs could grow by efficient accretion 
processes |394j . while other considerations suggest that mergers could contribute significantly to 
the early growth of black holes |206j . Studies of early cosmic structure [245] indicate that the 
massive black holes form from the coalescence of many smaller seed black holes. Many models 
based on this idea have been developed and numerically simulated [1771 13751 13511 15"2"] . The number 
of events detectable by LISA was estimated to be in the range 3-300 per year |326j , with a spectrum 
of masses in the range 10 3 -10 7 M Q . The predicted event rate is not much different for eLISA |22) . 
although some of the marginally detectable events involving lighter black holes at high redshift 
will no longer be observable. 

Figure [2] shows contours of constant SNR (as seen by eLISA) in the redshift-total mass plane 
for equal-mass, nonspinning MBH mergers (left panel) and in the total mass-mass ratio plane for 
MBH mergers at a fixed redshift of z — 4 (right panel). For typical events at z ~ 4, the SNRs of 
eLISA observations are expected to exceed 100 for total masses ~ 5 x 1O 5 M . For comparable-mass 
mergers at lower redshifts, eLISA observations could exceed SNRs of 1000. SNRs for LISA are 
typically factors of 2-3 higher. For total mass > 1O 5 M0, a significant fraction of this SNR comes 
from the final merger and ringdown phase. 

The practical detection of MBH systems will have to rely on careful data analysis. Several de- 
tection algorithms for MBH binaries have been studied by a number of different groups, encouraged 
in part by the Mock LISA Data Challenges (MLDCs) These include Markov Chain Monte 

Carlo techniques [107) . time-frequency analysis [75], particle-swarm optimization [162) . nested 
sampling [138j . and genetic algorithms [288] . The most recent data challenge included multiple 
spinning MBH binary signals in the same dataset, and multiple groups demonstrated their ability 
to recover these binaries from the LISA data stream [36] , 

Astrophysics using MBH mergers. GW observations will directly probe the character of the 
black holes on scales comparable to the size of the event horizons. How the information about the 
black holes (such as their spin and mass) is encoded in the waveform is a topic of much focused 
research (see, for example, [2271 1328 for reviews). 

The precision with which GW observations will be able to extract the 17 parameters character- 
izing a MBH binary system has been the subject of extensive investigation. The first comprehensive 
study [110] considered the LISA determination of the sky location, luminosity distance, and mass 
of MBH systems, by including a simple model of LISA's orbital motion around the Sun. Later 
studies [266] considered the angular resolution of detectors in precessing-plane configurations like 
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Figure 2: Contours of constant SNR for MBH binaries observed with eLISA. The left-hand panel 
shows contours in the total mass-redshift plane for equal-mass binaries, while the right-hand panel 
shows contours in the total mass-mass ratio plane for sources at redshift z — 4. Reprinted from 
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LISA's, as well as ecliptic-plane interferometers that lie flat in the ecliptic plane as they orbit the 
Sun. More recent studies have extended MBH-binary waveforms with spin effects, higher har- 
monics, and with merger and ringdown signals, and have looked at the effect of these corrections 
on LISA's parameter estimation ability |369l 12951 [2"§1 12211 1254) . It is expected that observations 
with a LISA-like detector will be able to determine the masses of the two binary components to 
0.01-0.1%, the spins to 0.001-0.1, the sky position of the binary to 1-10 deg 2 and the luminosity 
distance to 0.1-10%. For eLISA, the corresponding results are 0.1-1%, 1-10%, 10-100 deg 2 and 
10-100%. Such precise measurements will provide unprecedented information about black holes in 
the Universe. 

These parameter-estimation studies witness the quality of the information that will be available 
to astrophysicists for the phenomenological exploration of the astrophysical character of black-hole 
systems. Of particular interest is the evolution of black-hole spins and the final spin of merger 
remnants [201] . which GW observations should be able to shed some light on. In addition, the 
set of detected MBH merger systems will provide constraints on the formation history of MBHs 
|21M [HI HM I2SS]. In Refs. [3251 [TM] it was shown that a detector like LISA would, with high 
confidence, be able to tell the difference between ten different models for the growth of structure 
in the Universe. For many of the models, the difference would already be apparent after as 
little as three months of data collection. A similar model differentiation would also be possible 
with eLISA j^T] and both detectors will also be able to constrain "mixed models" combining the 
different black hole population models. MBH binaries can also be used as standard candles to 
probe cosmological evolution [195] . 



Testing relativity using MBH mergers. MBH mergers are good laboratories for testing rel- 
ativity because of the high SNR expected for the signals and the correspondingly large volume 
within which they can be observed. This makes them good systems for constraining GW polar- 
izations (see Sec. 5.1.1) since relatively weak alternative-polarization signals could in principle be 
detected. They are also excellent systems for testing GW propagation effects, such as subliminal 
propagation speeds and the presence of parity violations, since these effects accumulate with dis- 
tance and MBH binaries can be seen to very high redshifts (see Sec. 



5.1.2). MBH binaries are 
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also well suited for generic tests of relativity based on measuring the evolution of inspiral phasing 
and checking it for consistency with GR, since the high SNR allows a large number of phasing 
parameters to be measured (see Sec. 5.2). 

MBH binaries are also the only systems for which the quasinormal ringdown radiation, gen- 
erated as the highly distorted post-merger black hole settles down into a quiescent state, will be 
detectable. Although the amount of energy deposited into different ringdown modes is not well 
understood, the frequencies and decay rates of the ringdown modes are predicted precisely in GR 
from perturbation theory. The mass and spin of the black hole can be measured from a single 
ringdown mode; if multiple modes arc detected, the system can be used to check for consistency 
with the Kerr metric, and therefore provides a probe of black-hole structure. For a comprehensive 
review of black-hole ringdowns, we refer the reader to |223j . A more detailed discussion of the use 
of ringdown radiation to probe black hole structure may be found in Sec. |6.3| 

Last, the space-based GW observation of MBH binaries offers an unprecedented opportunity to 
probe the fully dynamical strong-field regime of GR by comparison of the observed merger signals 
to the predictions of numerical relativity. This is an area that has not yet been explored in any 
detail and so we will not discuss it further in this review. However, it is an important subject that 
should be carefully explored before a space-based GW detector finally becomes a reality. 



4.2 Extreme— mass-ratio inspirals 

An extreme-mass-ratio inspiral (EMRI) is the GW-emitting inspiral of a stellar-mass compact 
object, either a black hole, neutron star or white dwarf, into a MBH in the center of a galaxy. 
A main-sequence star with mean density p = lOg cm" 3 will be tidally disrupted by a black hole 
of mass 10 6 M Q at a distance of ~ 25 Schwarzschild radii (151] . At such separations an extra- 
galactic EMRI system will not be generating detectable amounts of gravitational radiation in the 
low-frequency band. The tidal disruption radius increases with decreasing stellar density and 
decreasing central black hole mass and the reference values used above are at the upper end of 
suitable values for main-sequence stars and milli-Hertz GW sources. It is therefore not expected 
that the inspiral of a main-sequence star will be a candidate for an EMRI, with the possible 
exception of such sources in the Galactic center, which is sufficiently nearby that radiation from 
an object orbiting at several tens of Schwarzschild radii might be detectable [151] . 

EMRIs occur in the dense stellar clusters that are found surrounding the black holes in the 
cores of galaxies, and are triggered by a variety of processes. In the "standard" scenario, two-body 
encounters in the stellar cluster gradually perturb the compact-object orbits, changing their angular 
momentum. This can put a compact object onto an orbit that passes very close to the central 
black hole. The orbit loses energy and angular momentum in bursts of gravitational radiation 
emitted near periapse, and if sufficient energy is radiated, the object can be left on an orbit that 
is bound to the central black hole. It will then gradually inspiral into the central black hole as it 
loses orbital energy and angular momentum to GW emission. Alternative mechanisms include the 
tidal splitting of binaries 259 , the tidal stripping of giant stars )114| I119j , and star formation in 
a disc surrounding the central black hole. More details can be found in [2~4l IT9"] . 



EMRIs as low-frequency GW sources. The intrinsic rate at which EMRIs occur depends 
on a lot of astrophysics that is rather poorly understood. The rate is strongly influenced by mass 
segregation [2~4l 11521 1260[ . triaxiality in galactic nuclei [194. 294J, resonant relaxation [303, 304J, and 
the "Schwarzschild barrier" |257j . A full description of these processes and their relative importance 
can be found in [24|. [19] . The range of EMRI rates per galaxy reported in the literature is ~ 1- 
1000 Gyr -1 for black holes, and ~ 10-5000 Gyr -1 for white dwarfs, with most recent best-guess 
estimates of ~ 400 Gyr -1 /20 Gyr _1 /7 Gyr -1 for black holes/white dwarfs/neutron stars [1961125] . 
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Uncertainties in the event rate come not only from the intrinsic rate of capture, but also from 
uncertainties in the number density of black holes of suitable mass. At present, only three objects 
in the range 1O 4 -1O 7 M0 are known from kinematic measurements [139] . Using galaxy luminosity 
functions, the luminosity-velocity dispersion relation and the black-hole mass-velocity dispersion 
relation, one infers the number density of MBHs in the LISA mass range to be roughly constant 
per logarithmic mass unit, and equal to a few 10~ 2 Mpc~ 3 . However, if black-hole merger kicks 
are significant, the number of black holes with M < 2 x 10 6 M Q may be significantly depleted |116| . 

EMRI waveforms are complex, since the orbits are expected to be both eccentric and inclined 
with respect to the equatorial plane of the central black hole. Various algorithms have been 
suggested for EMRI detection, including hierarchical matched filtering starting with short data 
segments |160j , time-frequency analysis |379| I155| 1154] , and Markov Chain Monte Carlo |344[ I38] 
1105] . Indeed, EMRIs have proven to be the most difficult source to detect in the MLDC datasets, 
but in the third challenge round two groups correctly identified the three EMRIs with the highest- 
mass central black holes [37] ■ Several challenging aspects remain unsolved, such as dealing with 
source confusion and detecting the EMRIs with lower-mass central black holes, which are expected 
to dominate the data; recent progress encourages confidence that these problems will be solved by 
the time a space-based detector is launched. 

Assuming that a matched-filtering SNR of 30 is required for EMRI detection, and using the 
previously quoted EMRI rates per black hole |196j . it was estimated that LISA could see several 
hundred events |157| out to a redshift z < 1. The intrinsic rate uncertainties mean that the 
actual number could be anywhere from a few tens to a few thousand. Most of the events will 
be black-hole inspirals into MBHs of mass 10 5 -10 6 M Q . The estimate for eLISA is a few tens of 
events [1631 |2"21 121j . although this is based on a lower SNR detection threshold of 20. (The lower 
threshold is justified by the results of the MLDCs, which successfully demonstrated detection of 
EMRI events with SNR of ~ 15, albeit with less than realistic signal confusion.) 

Astrophysics using EMRIs. EMRI observations will tell us about the properties of MBHs 
at low redshift and about the physics of nuclear stellar clusters. Each EMRI observation can 
determine many of the parameters of the system, including the mass and spin of the central black 
hole, to accuracies of a fraction of a percent [HI 1199] . With as few as ten EMRI observations, 
LISA would be able to constrain the slope of the black hole mass function at low redshift to 
~ ±0.3, which is the level of the current observational uncertainty |165j . eLISA would have the 
same capability given the same number of detections. If LISA observed as many EMRIs as current 
estimates predict, the constraints on the slope would improve by an order of magnitude, although 
the corresponding improvement for eLISA would be only a factor of two. EMRIs will also provide 
information on the distribution of black hole spins. The masses of the compact objects observed 
in EMRIs will provide information about mass segregation processes, while the overall number of 
events encodes information about the complicated physics that determines the event rate. Finally, 
the eccentricities and inclinations of the observed EMRIs will provide information on the relative 
efficiencies of the various channels through which EMRIs could be formed [MJ [TH] ■ 

Testing relativity using EMRIs. EMRIs are well suited for testing relativity for several rea- 
sons: the systems are expected to be clean since the small object is likely to be a black hole with 
no internal structure, and the influence of any gaseous material in the spacetime is expected to 
be negligible (but see Sec. pi); the waveforms can be computed very accurately using black hole 



perturbation theory |292[ [44] . allowing very small deviations from the model to be identified; the 
inspiral is slow, meaning that many waveform cycles are generated in the strong-field regime, and 
can be used to test the spacetime structure [143] : and the waveform structure is rich, due to the 
expected eccentricity and inclination of the orbits [MJ I121j . 
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The primary fundamental-physics application of EMRI observations is probing black-hole struc- 
ture, since the EMRI waveforms encode a map of the spacetime geometry close to the central black 
hole. This can be used to test whether the central object is indeed described by the Kerr metric, 
as we expect, or is some other kind of object. A discrepancy could indicate an astrophysical per- 
turbation in the system, a violation of the Cosmic-Censorship hypothesis [285] or of the energy 
conditions of GR, or even evidence for the existence of an exotic massive compact object such as 
a boson star. All of this will be discussed in detail in Sec. |6] EMRIs can also be used for tests of 
GW polarization (Sec. 5.1.1), and to constrain alternative theories of gravity through detection of 
modifications to the inspiral rate (see Sec. 5.1.3). 



4.3 Galactic binaries 

The most numerous source class in the low-frequency GW band are the ultra-compact binaries 
of two stellar-mass degenerate bodies: white dwarfs, neutron stars or black holes. Most of these 
will be in the Milky Way. Numerical population synthesis |272l I270| and theoretical studies of the 
highly-evolved stellar population in the Galaxy [204, 205 predict that there will be ~ 10 7 ultra- 
compact binaries generating GWs in the mHz band visible to space-based detectors like LISA or 
eLISA. 



Ultra-compact binaries as low-frequency GW sources. For an instrument with sensitivity 
similar to the "classic" LISA design, the complete population of compact galactic binaries would 
separate into two distinct observational classes: confused binaries and resolvable binaries. Confused 
binaries are those with orbital frequencies that are close enough to other binaries in the population 
that they cannot be readily distinguished from one another in the data [MJ 11911 1106] ; their signals 
are overlapping and cannot be disentangled from one another, unless one source happens to be 
much closer and stronger than the others. For LISA, the confused binaries make up a stochastic 
foreground signal from the Galaxy, which would be present in the data for all frequencies below 
the confusion limit f c ~ 3 mHz. Various attempts have been made to estimate the spectrum 
of this foreground [2711 13541 11291 11281 1310] . but the final profile would also depend on details 
of the algorithms used to analyze the LISA data |229L I106L 13461 1109] . For an instrument with a 
sensitivity comparable to eLISA, which is approximately a factor of two worse in the relevant range, 
this confusion foreground would be much less prominent, and perhaps not detectable [221 121] . 

Resolvable binaries are found both above and below the f c confusion limit. Sources below f c 
are binaries that happen to be nearby, and so dominate the emission near their central frequency. 
For sources above f c , the characteristic orbital frequencies are separated by a few frequency bin 
widths A/ ~ 1/T f, s , where T Q b s is the length of observation. An instrument like LISA would 
be expected to resolve as many as 18000 individual binaries [271] . while for eLISA the number is 
expected to be a few thousand [221 HI] . 

Among the resolvable binaries there are several verification binaries (see [345] ; for a current list 
of the known verification binaries, see |273| ). These are binaries that are expected to be resolvable 
in the LISA-eLISA data, but have already been detected and characterized using electromagnetic 
telescopes. These known binaries will provide early verification that the mission is operating 
correctly at the nominal sensitivity. 



Astrophysics using ultra-compact binaries. The large number of compact binaries de- 
tectable throughout the entire Galaxy makes these sources an exceptional resource to address 
questions about the stellar evolutionary history of the Galaxy and the astrophysical dynamics of 
strong-field interactions in compact systems. If the confusion foreground is observed, it will be 
modulated by detector motion, a fact that can be used to estimate the multipole moments of the 
spatial distribution of binaries in the Galaxy |3271 1130] . A detector like LISA might also be able to 
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detect the extra-Galactic population of ultra-compact binaries, providing a valuable probe of its 
characteristics |137l 11011 1103] . However, an extra-Galactic population is unlikely to be observed 
by eLISA. 

Detections of Galactic binaries will be dominated by double-white-dwarf binaries, which are 
scarce in the galaxy, so (e)LISA will be in a unique position to illuminate their physics. This will 
shed light on their formation and evolution, including the poorly understood phase of common- 
envelope evolution. (e)LISA is also expected to observe interacting binaries where mass transfer 
is ongoing. GW observations can shed light on the fraction of mass-transferring systems; these 
systems may allow coincident electromagnetic observations, making them even more precise probes 
of astrophysical processes such as mass transfer and tidal coupling 323, 387, 324, 386 . 

Testing relativity using ultra-compact binaries. Resolvable compact binaries offer a good 
laboratory for testing GR because their waveforms and evolution are well understood and easily 
described using model templates, and because a reasonable fraction of them will be visible both 
to (e)LISA and to electromagnetic telescopes. 

Ultra-compact binaries will provide constraints on massive-graviton theories, either by compar- 
ison of GW propagation delays to the observable electromagnetic signals from the binary, or by 
the dispersion of the gravitational waveform as the binary slowly evolves. This is discussed in Sec. 
|5.1.2| The ultra-compact binaries also provide laboratories for testing waveform evolution, and 
for GW polarization and dispersion studies, precisely because they evolve slowly and are easily 
described by post-Newtonian analysis. This is discussed in Sec. |5.1.1| and Sec. |5.2| 
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5 Gravitational-wave tests of gravitational physics 



Almost since its inception, GR was understood to possess propagating, undulatory solutions — 
GWs, described at leading order by the celebrated quadrupole formula [213 . It took several 
decades to establish firmly that these waves were real physical phenomena and not merely artifacts 
of gauge freedom. 

How would GW observations test the GR description of strong gravitational interactions, and 
possibly distinguish between GR and alternative theories? To answer this question we need to 
take a quick detour through GW data analysis. At least for foreseeable detectors, individual GW 
signals will typically be immersed in overwhelming noise, and therefore must be dug out with 
techniques akin to matched filtering [209 , which by definition can only recover signals of shapes 
known in advance (the templates), or very similar signals. A matched- filtering search is set up 
by first selecting a parametrized template family (where the parameters are the source properties 
relevant to GW emission), and then filtering the detector data through discrete samplings of the 
family that cover the expected ranges of source parameters. The best-fitting templates correspond 
to the most likely parameter values, and by studying the quality of fits across parameter space it 
is possible to derive posterior probability densities for the parameters. 

After a detection, the first-order question that we may ask is whether the best-fitting GR 
template is a satisfactory explanation for the measured data, or whether a large residual is left 
that cannot be explained as instrument noise, at least within our understanding of noise statistics 
and systematics. (Slightly more involved tests are also possible: for instance, we may divide 
measured signals in sections, estimate source parameters separately for each, and verify that they 
agree.) If a large residual is found, many hypotheses would be a priori more likely than a violation 
of GR: the fitting algorithm may have failed; another GW signal, possibly of unexpected origin, 
may be present in the data; the data may reflect a rare or poorly understood instrumental glitch; 
the GW source may be subject to astrophysical effects from nearby astrophysical objects, or even 
from intervening gravitational lenses. 

Having ruled out such non-fundamental explanations, the only way to quantify the evidence for 
or against GR is to consider it alongside a set of alternative theories of gravity. If the alternative 
theories under consideration include one or more adjustable parameters that connect them to GR 
(such as wwd for Brans-Dicke theory, see Sec. 2.1), and if those parameters can be propagated 
through the mathematics of source modeling and GW generation, then GR template families can 
be enlarged to include them, and the extra parameters can be estimated from GW observations. 
These extra parameters may have a more phenomenological character, as would for instance a 
putative graviton mass that would affect GW propagation, without finding direct justification in a 
specific theory. Indeed, many of the "classic tests" discussed below (Sec. 5.1) fall within this class. 
To test GR against "unconnected" theories without adjustable parameters, we would instead filter 
the data through separate GR and alternative-theory template families, and use Bayesian model 
selection [193] to decide which model and theory are favored by the data. 

In the absence of well-defined foils, it may be desirable to proceed along the lines of the PPN 
formalism (Sec. 2.1) and immerse the GR predictions in expanded waveform families, designed 
to isolate differences in the resulting GW phenomenology (Sec. 5.2). Proposals to do so include 
schemes where the waveform-phasing post-Newtonian coefficients, which are normally deterministic 
functions of functions of a smaller number of source parameters, are estimated individually from 
the data [3DH3T]; the ambitiously named parametrized post-Einstein (ppE) framework 401 ; and 
the parametrization of Feynman diagrams for nonlinear graviton interactions |82j . In Sec. 5.3 we 



discuss ideas (so far rather sparse) to use the GWs from binary mergers-ringdowns to test GR. 

We close this introductory comments by discussing two methodological caveats. First, GW 
observations are often characterized as "clean" tests of gravitational physics — whereby the "clean" 
emission of GWs from the bulk motion of matter (already emphasized above) is contrasted to 
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"dirty" processes such as mass transfer, dynamical equation-of-state effects, magnetic fields, and 
so on. An even stronger notion of cleanness is important for the purpose of testing GR: for the best 
sources, the waveform signatures of alternative theories cannot be reproduced by changing the as- 
trophysical parameters of the system — this orthogonality is quantified by the fitting factor between 
the GR and alternative-theory waveform families [365] . (The degeneracy of the alternative-theory 
and source parameters would also lead to a "fundamental bias" in the estimation of astrophysical 
quantities [401) . ) 

Second, many of the results presented in this section rely on the Fisher-matrix formalism for 
evaluating the expected parameter-estimation accuracy of GW observations. As emphasized by one 
of us |363j , the Fisher matrix is built with the first derivatives of waveforms with respect to source 
parameters, and therefore can only "know" about the neighborhood of the true source parameters 
where the waveform can be construed as linear with respect to the parameters. If the estimated 
errors take the waveform outside that neighborhood, then the formalism is simply inconsistent and 
unreliable. Higher SNRs reduce expected errors and therefore would generally make the formalism 
"safer," but there is no absolutely safe SNR, depending on the number of parameters that need to 
be estimated, on their inter-entanglement, and on the strength of their effects on the waveforms. 

5.1 The "classic tests" of general relativity with gravitational waves 

As Will points out |3851 ch. 10], virtually any Lorentz-invariant metric theory of gravity must 
predict gravitational radiation, but alternative theories will differ in its properties. Will identifies 
three main properties that can be measured with GW detectors. These are the polarization, speed, 
and (emission) multipolarity of GR. In this paper, we broaden the scope of the third to include 
changes to the loss of energy to GWs in inspiraling systems. 

In analogy to the three classic tests of GR (the perihelion of Mercury, deflection of light, and 
gravitational redshift) we like to refer to the verification that these properties have the predicted 
GR values, rather than the values predicted by alternative theories, as the "classic tests" of GR 
using GWs. Just as PPN tests probe weak-field, slow-motion dynamics, these tests can be seen 
as probing the weak-field far zone where waves have propagated far from their sources. However, 
the multipolarity of GWs at emission and the energy that they carry away can be influenced by 
strong-field properties in the near zone where waves are generated. 

5.1.1 Tests of gravitational-wave polarization 

GR predicts the existence of two transverse quadrupolar polarization modes for GWs (also de- 
scribed as "spin-2" and "tensor" using the language of group theory), usually labeled h + and h x . 
Alternative metric theories of gravity predict as many as six polarizations [385 (three transverse 
and three longitudinal), corresponding to the independent electric-type components of the Rie- 
mann curvature tensor, i?oiOj- Schematically, these components are measured by GW detectors by 
monitoring the geodesic deviation of nearby reference masses. The effect of different polarization 
modes is best illustrated by the induced motion of a ring of test particles, as in Fig. [3} The response 
of a standard right-angle interferometer to a scalar wave is maximal when the wave propagates 
along one arm; by contrast, tensor modes elicit maximal response when the wave propagates in a 
direction perpendicular to the plane of the detector. 

Direct detection. The use of GW polarization modes to test GR was first proposed in 1973 
|1271 [126 . The sensitivity of resonant and interferometric detectors, as well as Doppler-tracking 
and pulsar-timing measurements, to the extra modes was considered in several studies 282, 189, 
13291 13761 12401 12461 12681 12331 [2741 [L8\ H21 ESI HHSJ- In the most general setting, the problem of 
disentangling the modes has eight unknowns - the time series for the six polarizations, plus two 
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Figure 3: Effect of the six possible GW polarization modes on a ring of test particles. The GW 
propagates in the z direction for the upper three transverse modes, and in the x direction for the 
lower three longitudinal modes. Only modes (a) and (b) are possible in GR. (Illustration from 
Will |352] .1 



direction angles that affect the projection of the modes on the detector - but only six observables, 
corresponding to the RoMj components. Thus the problem is indeterminate, unless the source 
position is known from other observations (such as time-of-flight delays for a long-baseline network 
of detectors), or unless we restrict GWs to transverse modes on theoretical grounds. 

Space-based observatories similar to LISA have either a single independent interferometric 
observable or three (if laser links are maintained across all three arms). Each observable measures 
a different admixture of polarizations. Thus, a detector with three active arms could in principle 
discriminate a non-GR polarization mode if the direction to the source is known, or if it can be 
determined from the measured signal (by means of the modulations produced by orbital motion, 
or by triangulation between the signals measured at the three spacecraft). 

The LISA sensitivity to alternative polarization modes was assessed in |355j , using the full TDI 
response (see Sec. 3.1 1. At frequencies larger than the inverse light-travel time along the arms, 
LISA would be ten times more sensitive to scalar-longitudinal and vector modes [(d) to (f) in 
Fig. [3] than to tensor and scalar-transverse modes [(a) to (c) in Fig. [3]. At lower frequencies, the 
sensitivity to all modes is approximately the same. These results have not yet been used to work 
out the constraints that LISA could place on specific alternative theories using different types of 
sources. 



Solar oscillations. Finn |141j observed that Solar oscillations with 5-10-min periods produce 
gravitational strains ~ 10 -26 at Earth, possibly within reach of space-based detectors. The de- 
tectors would measure the Sun's dynamical gravitational field in the transition region where it is 
turning into radiation. Finn showed that the field develops a significant phase shift relative to 
Solar oscillations, which depends on the GW polarizations, and which could distinguish between 
scalar, vector, tensor, and scalar-tensor theories of gravity. The limit placed by such observations 
on the Brans-Dicke parameter would be weaker than current bounds from Solar-system tests; on 
the other hand, measuring incipient GWs in the transition zone makes this a novel and possibly 
unique test. We note however that Finn's early exploration [141] predates our full understanding 
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of the design and parameters of LISA-like missions, which are likely to be less sensitive to this 
signal. 



Galactic binaries. Among the compact Galactic binaries that would be detected by a LISA-like 
detector, several have orbital inclinations known from optical observations. For these systems we 
can compute the specific linear combination(s) of polarizations that would be appear in the data, 
which can then be checked for consistency. A single inconsistent binary may indicate an error in 
the determination of inclination or distance, but systematically inconsistent sources would hint at 
large non-tensor GW components. However, from general arguments the measurement accuracy 
for polarization amplitudes is ~ 1/SNR (with SNR a few tens at most for Galactic binaries), so 
only very large corrections to GR would be detectable in this way. 



5.1.2 Tests of gravitational-wave propagation 

In GR, gravitational radiation propagates at the speed of light: v g = c. The experimental validation 
of this prediction can be posed as a bound on the graviton mass m g , which is exactly zero in GR 
(see [5TJ 1172) for a broader context). However, it may be advisable to consider m g as a purely 
phenomenological parameter, since massive-graviton theories do not recover GR predictions such 
as light bending [3B51 [407| l39| . 

Weak-field measurements in the Solar system already provide bounds on m g on the basis of the 
massive-graviton Yukawa correction to the Newtonian potential: 

V(r) = --exp(-r/\ g ), (21) 
r 

where X g = h/m g is the Compton wavelength of the graviton. The corresponding GW propagation 
speed would be given by 

ft)'- 1 " 

with f g the frequency of radiation. The best Solar-system tests provide the bound A g > 2.8 x 10 12 
km (or m g < 4.4 x 1CP 22 eV) 35(J|. By contrast, binary-pulsar dynamics only provide a bound 
m g < 7.6 x 10 -20 eV |142j . As we discuss in this section, observations of binary GWs with LISA-like 
detectors could provide bounds competitive with these results, with the advantage of examining 
a rather different sector of gravitational physics, wave propagation. Two distinct methods have 
been proposed for this. 



Comparing the phase of GW and EM signals. This technique offers a direct comparison of 
the speed of GWs with the speed of a radiation assumed to be null (light itself) . For the technique 
to work, sources must be observable in both light and GWs, and the astrophysical delays (if any) 
between the two signals must be well understood and modeled. The most prominent low-frequency 
sources for this purpose are compact Galactic binaries. 

Let the difference between the arrival times of GWs and EM signals be 

At = At a - (1 + z)At e , (23) 

where At a arises from propagation (the very effect we wish to measure), and At e from different 
emission mechanisms or geometries. Here z ~ DlHq/c is the redshift of the source, with Dl the 
luminosity distance and H the value of the Hubble parameter. In terms of v g , 

e<sl -* = lxl „-..(i£)(£). < 24 > 
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which we relate to m g using the total relativistic energy, 

The measurement of At has been considered repeatedly in the literature |230L lllll 1102] . The 
main difficulty lies with modeling the emission delay At e : consider for instance AM CVn binaries, 
where a low-mass helium donor has expanded to fill its Roche lobe and is spilling mass onto a 
white-dwarf primary. The EM signal from these systems is affected greatly by the light emitted 
from the overflow stream impacting the accretion disk, and the light curve oscillates as the system 
orbits, alternately flashing the impact point toward and away from the observer. The times of 
maximum emission can be taken as reference for the EM phase, but how are they related to GW 
emission? 

To evaluate this At e , one may observe the compact binary at two epochs, ideally at opposite 
points across the Earth's orbit [230, f f f . Under the assumption that At e is constant, differencing 
the total At measured at the two epochs leaves a measure of At a alone. However, the subtraction 
reduces At a to what can be accumulated across the diameter of the Earth's orbit, rather than 
across the entire distance to the binary. As a consequence, the strongest bound from known LISA 
verification binary would be A g > 3 x 10 13 km (m g < 4.6 x 1CT 23 eV). 

Alternatively, one may concentrate on eclipsing compact binaries, where the light curve varies 
due to the mutual eclipses of the binary components, allowing the orientation geometry of the 
system to be precisely determined as a function of time, and yielding an accurate measure of At e . 
In this case the measured At a is accumulated over the entire Earth-pulsar distance. Only one 
eclipsing binary that would be observable with LISA-like detectors is currently known 190 , but 
an analysis of their statistically expected population suggests that LISA would obtain a bound 
A g > 2 x I0 14 km (m g < 6 x 10~ 24 eV) [TP2] . 

The reader may question whether it is appropriate to compare gravitons to photons, when the 
current bound on the putative mass of the photon is as high as m 7 < 2 x 10~ 16 eV. However, 
the much higher frequency of optical photons compared to low-frequency gravitons leads to e 7 < 
3 x 10 -33 , much smaller than e g < 10~ 8 (for Solar system tests) [230 , so a comparison based on 
speeds is indeed appropriate. 

A related test using pulsar-timing observations would compare the GW-induced phase delays 
accumulated by photons traveling to Earth from different pulsars [232 : the delays depend on the 
graviton speed through a geometric factor that alters the standard Hellings-Downs correlation 
function between the delays. 



Measuring the dispersion of gravitational-wave chirps. The chirping signals emitted by 
inspiraling binaries contain a range of frequency components: if the graviton has mass, the com- 
ponents propagate at different speeds, again given by Eq. (25). This effect can be modeled in the 
templates used to search for binary signals by including a A g dependence in the waveform phasing 
[381J. In the frequency-domain representation, the propagation effect appears as a "dephasing" 
term — ^(7r7W c /) _1 , where j3 = tt 2 DM c / 'A 2 ,/ '(1 + z) with M c the binary chirp mass, / the GW 
frequency, D the source distance, and z the source redshift. By comparison, the leading-order 
term in the post-Newtonian expansion of the phasing is 3/128(7r.M c /) _1 , while the X g correction 
contributes the same power of orbital frequency as the "1PN" term. 

For space-based detectors, the best chirp-dispersion bounds will come from massive-black- 
hole systems; they improve slightly with the total binary mass and with better low-frequency 
(10^ 5 -10 -4 Hz) sensitivity. However, the expected bounds depend strongly on which other physi- 
cal effects (such as spin-induced precessions, orbital eccentricity, higher waveform harmonics, the 
merger-ringdown phase) will be relevant in the detected systems. As a result, a variety of predic- 
tions have appeared in the literature [3841 1551 1591 l3"2l I339L 13911 13401 12141 1203] . Bounds as strong as 
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Xg > a few 10 16 km seem possible, and would be strengthened by analyzing full catalogs of binary 
detections at once |61j . 

Instead of the chirping signals from inspiraling binaries, Jones [211] proposes a test of the GW 
dispersion relation using the waves from eccentric Galactic binaries, which are emitted at multiple 
harmonics of the orbital frequencies; if at least one Galactic binary has sufficient eccentricity, Jones 
claims sensitivity comparable to the chirp-dcphasing measurements. Mirshckari and colleagues 
[26 lj extend the graviton-mass formalism to more general modified- gravity theories that predict 
violations of Lorentz invariance and modified dispersion relations for GW modes, given by 

E 2 = p 2 c 2 + m 2 g c A + Ap a c a ; (26) 

both nig and A can be constrained together, given the a corresponding to specific theories, by 
inspiral-binary observations with ground- and space-based detectors. 

Parity violations. In GR parity is a conserved quantity, so left- and right-circular polarized 
gravitational radiation propagates alike. Many attempts to formulate a quantum theory of gravity 
require the addition of a parity-violating Chern-Simons (CS) term to the Einstein Hilbcrt action 
[171113112531: 

Scs = ^JeR*Rd 4 x, where R* R = ^R a0yS e a ^ u ; (27) 

here R a /3jS is the Riemann tensor, is the Levi-Civita tensor density, and 9 is a (possi- 

bly) position-dependent function that describes the coupling of the CS field to spacetime. This 
correction creates a difference in the propagation equations for the left- and right-circular GW 
polarizations, resulting in their amplitude birefringence: one circularly polarized state is amplified 
through propagation, while the other is attenuated. 

This effect is potentially observable with LISA-like detectors for MBH-binary inspirals at cos- 
mological distances |H] (see also [395] ). where the amplitude birefringence generates an apparent 
precession of the orbital plane of the binary. The CS correction accumulates with distance, and 
is larger for sources at higher redshifts. Orbital-plane precession will also arise from general- 
relativistic spin-orbit coupling, but the scaling of the precession with frequency is different, so the 
two effects can be distinguished, at least in principle. 

For an equal-mass binary with redshifted masses of 1O 6 M0 that is observed plane-on at a 
redshift z = 15, LISA could constrain the integrated CS contribution at the level of 10 -19 [5]. 
This is several orders of magnitude better than Solar-System experiments, which furthermore can 
only provide local constraints. Thus, LISA-like detectors may provide some hints as to the very 
quantum nature of gravity. 

5.1.3 The quadrupole formula and loss of energy to GWs 

In theories that do not satisfy the strong equivalence principle, the internal gravitational binding 
energies of bodies can create a difference between the inertial dipole moment (i.e., the linear 
momentum, which is conserved) and the GW-generating gravitational dipole moment. Thus, 
alternative theories of gravity generally admit dipole radiation, but it is forbidden in GR, where 
the two moments are identical. Dipole radiation would be given at leading order by [382J 

Vl at i)i \ m{ / 

where Xi^ are the positions of the binary components, v their relative velocity, m\ 2 and mf 2 are 
their inertial and gravitational masses respectively, [i 1 is the inertial reduced mass, and Dl is the 
luminosity distance to the observer. 
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For relativistic objects such as neutron stars, the gravitational binding energy can be con- 
siderable and so can be the resulting loss of energy to dipolar GWs. Indeed, the experimental 
result that the orbital decay of the binary pulsar PSR1913+16 [241] adhered closely to GR's 
quadrupole-formula prediction was sufficient to definitely falsify GR alternatives such as bimetric 
and "stratified" theories [385J. (Amusingly certain theories even predict that dipole radiation 
carries away negative energy from a binary |385| .) Thus it is factually correct to state that the 
indirect detection of GWs has already provided a strong test of GR. 

By contrast, the binary pulsar could not falsify scalar-tensor theories in this way, because these 
are "close" to GR. For instance, although dipole radiation is predicted by Brans-Dicke theory 
and changes the progression of orbital decay, the coupling parameter ojbd can be adjusted to 
approximate GR results to any desired accuracy. GR is reproduced for wdb — > oo, so experimental 
bounds on Brans-Dicke are lower bounds. The binary pulsar does provide a bound on wbdj but 
one that is not competitive with solar-system tests, among which the best comes from the Doppler 
tracking of the Cassini spacecraft, which sets wdb > 40, 000 [3821 I65j . 

LISA-like detectors can constrain ubd by looking for dipole-radiation-induced modifications in 
the GW phasing of binary inspirals (monopole radiation is also present, but suppressed relative to 
the dipole), as long as at least one of the binary components is not a black hole: because of the 
no-hair theorem, black holes cannot sustain the scalar field that would lead to a differing m G and 
m 1 (as was recently confirmed in full numerical-relativity simulations |187| ). Instead, the canonical 
source for detecting this effect is the inspiral of a neutron star into a relatively low-mass central 
black hole, although the number of detections of such systems is likely to be very low |159j . 

Early studies [31711384] , based on simplified models of the waveforms and of the LISA sensitivity, 
estimated that for a 1.4 M@ neutron star inspiraling into an MBH the wbd bounds would scale as 

„ „4 I S \' I 100 \ I T \ 7/s /10 4 M S \ 3/4 

"»> 2x10 (<uj UJ(t^) (-srj • < 29 > 

where S (the "sensitivity") is a measure of the difference between the neutron-star and MBH 
self-gravitational binding energies per unit rest mass; A<I>£> is the dipole contribution to the GW 
phasing; T is the time of observation; and M. is the MBH mass. This estimate however is reduced 
by a factor of ten or more when more realistic waveforms are considered that include spin couplings 
[58l [59], spin-induced orbital precession and eccentricity [391. Bounds can be derived also for a 
massive-scalar variant of Brans-Dicke theory [62] , and are of order 

(where m s is the mass of the scalar, and p the detection SNR) for the intermediate-mass-ratio 
inspiral of a NS into a black- hole with mass < 1O 3 M0. 

These results were obtained using only the leading order correction from the scalar radiation. 
In [399j the authors extended this calculation to all post-Newtonian orders, but in the extreme- 
mass-ratio limit by using the Teukolsky formalism. The conclusion, that constraints on massless 
scalar-tensor theories from GW observations will, in general, be weaker than those from Solar 
System observations, was unchanged. The reason is that scalar-tensor theories are weak-field 
(infrared) corrections to GR and are therefore largest in the weak field, so the leading order 
correction captures the majority of the effect. Massive scalar-tensor theories were also considered 
in [399) . In those theories, the primary observable consequence is the possible existence of "floating 
orbits" at which the scalar flux experiences a super-radiant resonance and temporarily balances 
the GW flux. The transition of an EMRI through such a floating orbit is many orders of magnitude 
slower than the normal EMRI inspiral and can last more than a Hubble time. If an EMRI consistent 
with GR is observed it means that the EMRI not only did not pass through such a floating orbit 
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during the timescale of the observation but could not have encountered one prior to the observation 
since it would not then have reached the milihertz band. An observation of a single EMRI can 
therefore constrain the massive scalar-tensor parameter space to many orders of magnitude greater 
precision than current Solar System observations. 



Other modifications to the inspiral phasing. A number of other suggestions have been made 
for low-frequency-GW tests of GR that do not quite fit a "modified energy-loss" description. For 
instance, dynamical Chern-Simons theory introduces nonlinear modifications in the binary binding 
energy that could be observed in the late inspiral, constraining the characteristic Chern-Simons 
length scale t; 1 / 4 to < 10-f00 km |392j . two to three orders of magnitude better than current 
Solar-system constraints [TS] (advanced ground-based detectors could do even better). 

GW tails, which are due to the propagation of gravitational radiation on the curved background 
of the emitting binary, appear at a relative I .5PN order (c -3 ) beyond the leading-order quadrupole 
radiation, and their observation would test the nonlinear nature of GR [67] . (This would be a null 
test of GR, since tails are included in the "standard" post-Newtonian inspiral phasing; see also the 
PN-coefficient tests discussed in the next section.) 

Promoting Newton's constant, G, to a function of time modifies both a binary's binding energy 
and GW luminosity, and therefore its phasing. A three-year observation of a 10 4 -10 5 M & inspiral 
would constrain G/G to ~ 10 _11 yr _1 402]- The infinite Randall-Sundrum braneworld model 
may predict an enormous increase in the Hawking radiation emitted by black holes. The resulting 
progressive mass loss may be observed as an outspiral effect in the quasi-monochromatic radiation 
of Galactic black-hole binaries, and it would also affect the rate of EMRI events [253J . 



5.2 Tests of general relativity with phenomenological inspiral template 
families 

As discussed above, quantitative tests of GR against modified theories of gravity evaluate how well 
the measured signals are fit by alternative waveform families, or (more commonly) by waveform 
families that extend GR predictions by including one or more modified-gravity parameters, such as 
wwd for Brans-Dicke theory. To set up these tests we need to work within the alternative theory 
to derive sufficiently accurate descriptions of source dynamics, GW emission, and GW propaga- 
tion. An alternative approach is to operate directly at the level of the waveforms by introducing 
phenomenological corrections to general-relativistic predictions: for instance, by modifying specific 
coefficients, or by adding extra terms. 

This section discusses the first attempts to do so. So far these have concentrated on post- 
Newtonian waveforms 68 for circular, adiabatic inspirals, as described by the stationary-phase 
approximation in the frequency domain: 

M/) = A/" 7 / 6 exp + , (31) 

where / is the GW frequency; A is the GW amplitude, given by geometrical projection factors x 
Ai b J 6 ' I Dl (with A4 C = (mim2) 3 / 5 /( m i + W2) 1 ^ 5 the chirp mass and Dl the luminosity distance); 
and for simplicity we omit the nontrivial response of space-based detectors, as well as the PN 
amplitude corrections. The phasing ^S(f) is expanded as 

*(/) = 2nft c + $ c + [l>k + < S log /] /( fe " 5 )/ 3 . (32) 

fc=0 

For binaries with negligible component spins, the post-Newtonian phasing coefficients rpk are cur- 
rently known up to fc = 7 (3.5 PN order), and in GR they are all functions of the two masses m\ 
and m2 alone (although ipi = — ip^ & = 0, and 1/J5 is completely degenerate with <i> c , so it is 
usually omitted). 
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Figure 4: Estimating all the binary-inspiral phasing coefficients tpk of Eq. (32) yields differently 
shaped regions in the m\-mi plane, which must intersect near true mass values if GR is correct 
(from [H])- 



5.2.1 Modifying the PN phasing coefficients 

Arun and colleagues [30] propose a test of GR based on estimating all the ijjk simultaneously from 
the measured waveform as if they were free parameters, in analogy to the post-Keplerian formalism 
[2411 Sec. 4.5]. The value and error estimated for eacrj^jVfcj together with its PN functional form as 
a function of mi and ni2, determines a region in the m\-mi plane. If GR is correct, all the regions 
must intersect near the true masses, as shown in Fig. [4j The extent of the intersection provides 
a measure of how precisely GR is verified by a GW observation. A Fisher-matrix analysis [30] 
suggests that, for systems at the optimistic distance of 3 Gpc, LISA could measure ip to ~ 0.1% 
and 4>2 and tps to 10%, but that the fractional error on higher-order terms would be at best ~ 1. 

This setup, however, may understate the power of this kind of tests, since most of the estimation 
uncertainty in the V'fc arises from their mutual degeneracy — that is, from the fact it is possible to 
vary the value of a subset of ipk without appreciably modifying the waveform. This degeneracy 
should not impact the degree to which the data is deemed consistent with GR. In a follow-up 
paper |31j . Arun and colleagues propose a revised test whereby the masses are determined from 
ipo and ip2, while the other ipk (as well as and 4'e' S ) are individually estimated and checked 
for consistency with GR. In this case, even for sources at z = 1 (~ 7 Gpc), all the parameters 
can be constrained to 1% (a few % for ip4, 0.1% for -03), at least for favorable mass combinations. 
Performing parameter estimation for the eigenvectors of the Fisher matrix |281j indicates which 
combinations of coefficients can be tested more accurately for GR violations. 

However, it is not clear what significance with regards to testing GR should be ascribed to 
the accuracy of measuring the i/^, since we do not know at what level we could expect deviations 
to appear. By contrast, if we were to find that, say, the n-a regions in the m\-mi plane do not 
intersect, we could make the statistically meaningful statement that GR appears to be violated at 
the n-a level. 

4 Because log/ varies weakly in the relevant frequency ranges, it is considered constant in Ref. |30| . and the 
ip)° e log / combinations are absorbed in the tp^ . 
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Table 1: Leading-order effects of alternative theories of gravity, as represented in the ppE frame- 



work (Eq. (331). For GR a = P = 0. (From [103] .) 



Del Pozzo, Li, and colleagues propose a more satisfying formulation for these tests [1171 12361 
1237] , based on Bayesian model selection [173] . which compares the Bayesian evidence, given the 
observed data, for the pure-GR scenario against the alternative-gravity scenarios in which one or 
more of the tpk are modified. The issue of significance discussed above reappears in this context as 
the inherent arbitrariness in choosing prior probabilities for the ipi~i but Del Pozzo and colleagues 
argue that this does not affect the efficacy of the model-comparison test in detecting GR violations. 
(For a comprehensive discussion of model selection in the context of GW detection, rather than 
GR tests, see also Refs. [3701 I3TT1 1255] /) 

5.2.2 The parametrized post-Einstein framework 

In Ref. [401] . Yunes and Pretorius propose a similar but more general approach, labeling it the "Pa- 
rameterized Post-Einsteinian" (ppE) framework. For adiabatic inspirals, they propose enhancing 
the stationary-phase inspiral signal with extra powers of GW frequency: 

/jPPE(/) = ~ h GK (f) x (1 + a^Mff) exp [ip^Mff] (33) 

. While the initial suggestion in [401] is to consider 
a, b G R, there are analytical arguments why a and b should be restricted to values a = a/3 and 
b = 6/3, with (a,b) > (—10,-15) [HI], which reproduces Arun's PN-coefficient scheme for b > 
—5. Nevertheless, this representation can reproduce the leading-order effects of several alternative 
theories of gravity (see Table [I]). 

In Ref. |401j . Yunes and Pretorius are motivated by the possibility of detecting GR violations, 
but also by the "fundamental bias" that would be incurred in estimating GW-source parameters 
using GR waveforms when modified GR is instead correct. In [104] . Cornish and colleagues refor- 
mulate the detection of GR violations described by ppE as a Bayesian model-selection problem, 
similar to the PN-coefficient tests discussed in the previous section. Figure [5] shows the j3 bounds, 
for various fixed 6, that could be set with LISA observations of TOi,2 ~ 10 6 M Q binary inspirals 
at z = 1 and 3. For b corresponding to modifications in higher-order PN terms (which require 
strong-field, nonlinear gravity conditions to become evident), the bounds provided by LISA-like 
detectors become more competitive with respect to solar-system and binary-pulsar results (where 
weak-field conditions prevail). 

[MV: Should the next three paragraphs move to the EMRI section?] An extension of the ppE 
framework to EMRI systems requires a model in which orbits can be both eccentric and inclined. 
To develop this, Vigeland and colleagues [372] derive a set of near-Kerr spacetime metrics that 
satisfy a set of conditions, including the existence of a Carter-constant-like third integral of the 
motion, as well as asymptotic flatness. The solutions, which were previously found in [55], are 
restricted to a physically interesting subset by setting to zero any metric coefficients not required 



where h GK (f) is given in Eqs. plj) and (32 
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Figure 5: Constraints on phasing corrections in the ppE framework, as determined from LISA 
observations of ~ 1O 6 M massive-black-hole inspirals at z = 1 and z = 3. The figure includes also 
the j3 bounds derived from pulsar PSR J0737-3039 [396] , the solar-system bound on the graviton 
mass |350j . and PN-coefficient bounds derived as described in the previous section. The spike 
at b = corresponds to the degeneracy between the ppE correction and the initial GW-phase 
parameter. (Adapted from |104j .) 



to reproduce known black-hole solutions in modified gravity, and by applying the peeling theorem 
(i.e., by requiring that the mass and spin of the black hole not be renormalised by the perturbation). 

The existence of a third integral is not a requirement for black-hole solutions, but in general its 
absence allows ergodic behavior in the orbits. This is discussed as a potential observable signature 
for deviations from GR in Sec. |6.2.5| However, data-analysis pipelines designed for GR waveforms 
may be insensitive to such qualitatively different systems. The existence of a third integral is 
therefore a practical assumption for interpretation once a GR-likc EMRI has been observed. 

In Ref. |156j . Gair and Yunes construct gravitational waveforms for EMRIs occurring in the 
metrics of Ref. [372] . based on the analytic kludge model constructed for GR EMRIs [42]. The 
waveforms provide a ppE-like model for EMRIs that can be used in the same way as the circular 
ppE framework. Parameter-estimation results with these ppE-EMRI models have not yet appeared 
in the literature. 

5.2.3 Other approaches 

In Ref. [366], Vallisncri provides a unified model-comparison formulation of all modificd-GR tests 
that is valid for sufficiently loud signals, and that yields the detection SNR required for a statisti- 
cally significant detection of GR violations as a simple function of the fitting factor FF between the 
GR and modified-GR waveform families. The FF measures the extent to which one can reabsorb 
modified-GR effects by varying standard-GR parameters from their true values. 

An alternative to modifying frequency-domain inspiral waveforms is offered by Cannella and 
coauthors [82] [81]. They propose tests based on the effective-field-theory approach to binary 
dynamics |171j . which expands the Hilbert+point-mass action as a set of Feynman diagrams. In 
this framework, GR corrections can be introduced by displacing the coefficients of interaction 
vertices from their GR values. For instance, multiplying the three-graviton vertex by a factor 
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( 1 + /3 3 ) affects the conservative dynamics of the theory in a manner similar to the PPN parameter 
/3, but has consequences also on radiation. A similar modification to the four-graviton vertex 
(parametrized by ,$4) yields effects at the second post-Newtonian order, so it has no analog in PPN. 
Cannella and colleagues argue that GR- violating values of j3s and /S4 would not be detectable with 
GW signals, but they would instead generate small systematic errors in the estimation of standard 
binary parameters. 

5.3 Beyond the binary inspiral 

According to GR, black-hole mergers are the most energetically luminous events in the Universe, 
with L ~ 10 23 Lq ~ 10 56 erg/s, regardless of mass: at their climax, they outshine the combined 
power output of all the stars in the visible Universe. Nevertheless, second-generation ground-based 
GW interferometers are expected to yield the first detections of black- hole mergers [5], but only 
with rather modest SNRs. By contrast, LISA-like GW detectors would observe the mergers of 
heavier black holes, with SNRs as high as hundreds or more throughout the Universe, offering very 
accurate measurements of the merger waveforms. Massive-black-hole coalescences may feature 
significant spins and eccentricity, further enriching the merger phenomenology 63, 308 . 

The powerful merger events correspond to very relativistic velocities and very strong gravita- 
tional fields, so much that the PN expansion of the field equations cannot be applied, and we must 
resort to very complex and costly numerical simulations |91j . This makes it challenging to encode 
the effects of plausible GR modifications in the signal model. The first ppE paper [401] makes such 
an attempt on the basis of a very crude model of merger-ringdown signals, which would probably 
be insufficient even to phase-match the GR signals themselves. Broad efforts are currently un- 
der way to build phenomenological full- waveform (inspiral-merger-ringdown) models 7, 283, 352 ; 
these involve tunable parameters that are adjusted to match the waveforms produced by numerical 
relativity. Such parameters could be used also to encode non-GR effects in the merger-ringdown. 
At this time, however, designing such extensions in a principled way seems daunting. 

A simpler approach, proposed by Hughes and Menou |202j . involves the golden binaries for 
which system parameters can be estimated from both inspiral and ringdown GWs. The former 
encode the parameters of the binary, while the latter encode the parameters of the final black hole 
formed in the merger. The functional relation of the two sets of parameters can then be compared 
with the predictions of numerical relativity, providing a null test of the strong-field regime of GR. 

Hughes and Menou focus on measuring the remnant's mass deficit, which equals the total 
energy carried away by GWs, so their definition of golden binaries selects those in which the mass 
deficit can be estimated to better than 5%. For LISA, these systems tend to have component 
masses between a few 10 5 M & and a few 10 6 Mq, and to be found at z ~ 2-3, making up 1-10% 
of the total merger rate depending on black-hole population models. The estimates of |202j are 
based on rather simple waveform models that omit a range of physical effects, so they could be seen 
as conservative, given that increased waveform complexity tends to improve parameter-estimation 
accuracy. 
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6 Tests of the nature and structure of black holes 



It has become apparent over the last few decades that the centers of most galaxies harbor massive, 
dark compact objects with masses in the range 1O 6 -1O 9 M0. It is clear that these objects play a 
very important role in the evolution of galaxies. This is exemplified by the very tight measured 
correlation (the M-a relation) between the mass of the central dark objects and the velocity 
dispersion of stars in the central spheroid 140, 357 . It is generally accepted that the central 
dark objects are black holes described by the Kerr metric, but there is presently no definitive 
proof of that assumption. The alternatives to the black hole interpretation include dense star 
clusters, supermassive stars, magnetoids, boson stars, and fermion balls. Support for the black 
hole interpretation has arisen as a result of both observational and theoretical work. A short review 



of the evidence may be found in [90 , although we summarize some key details in Sec. 6.1 

As described in Sec. |2.3| the theoretical basis for the belief that these objects are Kerr black 
holes has arisen from proofs that singularities inevitably form during gravitational collapse [280, 
12841 [185 and that the Kerr solution is the unique stationary and axisymmetric black-hole solution 
in GR [2071 1307j . The uniqueness of the Kerr solution is sometimes referred to as the "no- hair" 
theorem, since the solution is characterized by just two parameters, the black-hole mass M and 
angular momentum (per unit mass) a. 

The field of any vacuum, axisymmetric spacetime in GR can be characterized by a sequence of 
mass and current multipole moments, which we denote as M; and Si respectively |1671 1178] , For 
Kerr spacetimes these multipole moments are all determined by the mass and spin via 

Mi+iSt =M(ia) 1 , (34) 

so these spacetimes require no additional independent parameters or "hair" . The proof of the 
uniqueness theorem relies on various assumptions beyond the validity of the Einstein equations, 
so a demonstration of the non-Kerr nature of astrophysical black holes could reveal exotic physics 
within GR. It might also indicate the presence of material in the spacetime outside the black-hole 
horizon, or a deviation from GR in the true theory of gravity. In this section we discuss the 
potential of space-based low-frequency GW detectors to probe the structure of massive compact 
objects and the possible interpretation of these results. 



6.1 Current observational status 

The observational evidence for the presence of black holes in the centers of galaxies has come mainly 
from the studies of Active Galactic Nuclei (AGN). These are known to be extremely energetic 
and also compact — typical luminosities of 10 46 erg s _1 are produced in regions less than 10 16 m 
across |226j . The inferred AGN efficiency of ~ 10% is much greater than the typical efficiencies 
of nuclear fusion processes (~ 1%), implying the need for a very deep relativistic potential. X-ray 
observations show variability on timescales of less than an hour, while observations of iron lines 
indicate the presence of gas moving at speeds of several thousand km per second [226] . Radio 
observations of water maser discs are consistent with Keplerian motion around very compact 
central objects. In the spiral galaxy NGC 4258 VLBA observations have indicated a disc with an 
inner (detected) edge at ~ 0.1 pc, around an object of mass 3.6 x 1O 7 M |262j . Such compactness 
cannot be realized by a stellar cluster. In addition, about 10% of AGNs are associated with 
jets, which move at highly relativistic velocities and persist for millions of years. This requires 
a relativistic potential that has a preferred axis that is stable over very long timescales. AGNs 
are also remarkably similar over several decades of mass, which favors the black-hole explanation, 
again because Kerr black holes are characterized by just M and a. 

In the Milky Way, evidence for the presence of a black hole coincident with the Sgr A* radio 
source has come from observations of stellar motions. These are completely consistent with Kep- 
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lerian motion around a point source of mass M ~ 4 x 10 6 M [1691 I168| . One star, S2, has been 
observed for one complete orbit and from this it has been possible to put a limit of 0.066 on the 
extended fraction of the central mass that could be contained between pericenter and apocenter 
of the orbit of S2. At perihelion S2 was ~ 100 AU from the central object, which provides a fairly 
tight constraint on its compactness. 

Electromagnetic observations can rule out stellar clusters as the explanation for the massive 
central objects, but some of the exotic alternatives remain. X-ray emission comes from the very 
inner regions of accretion discs, but uncertainties in the radius from which the emission is coming 
and in the mass and spin of the central object limit their utility for probing the structure of the 
central object |302j . It is also possible to construct very compact boson star spacetimes [216J that 
could not be ruled out from electromagnetic observations alone. The same applies to spacetimes 
with a naked singularity. GW observations, by contrast, will probe the spacetime structure as 
the object proceeds through the inspiral and then passes the innermost stable orbit and plunges 
into the horizon of the central object, if a horizon exists. We discuss the prospects for using GW 
observations to probe black-hole structure in the following sections. 



6.2 Tests of black-hole structure using EMRIs 



6.2.1 Testing the "no-hair" property 



Equation (34) tells us that a Kerr black hole is uniquely characterized by two parameters. If we 
can measure three multipole moments of the spacetime, we can check that they are consistent 



with (34). If they are not, then the object cannot be a Kerr black hole. Boson stars are uniquely 



characterized by three multipole moments |313j , so a LISA measurement of four multipole moments 
could exclude these as an explanation. 

GWs from EMRIs are complicated superpositions of different frequency components, found 
at harmonics of the three fundamental frequencies of the orbit: the orbital frequency and the 
frequencies of precession of the perihelion and of the orbital plane [121] . This complex structure 
encodes detailed information about the spacetime in which the GWs are generated. The details 
of this encoding were first worked out by Ryan |311j . By using the Ernst potential p,32 j and 
expressions due to Fodor et al. [148] that relate this potential to the multipole moments of the 
spacetime, Ryan was able to study the properties of orbits in vacuum and axisymmetric spacetimes 
that possess an arbitrary set of mass and current multipole moments. For circular, equatorial orbits 
in these spacetimes, the perihelion and orbital plane precession frequencies can be readily computed 
(for circular and equatorial orbits, these are basically the frequencies of epicyclic oscillations in the 
radial and vertical directions respectively). One finds [311 
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where f2 r and Cl z are the perihelion and orbital plane precession frequencies, and Mi /Si denote 
the mass/current multipole moments of the spacetime metric, as in Eq. (34). The primary con- 
clusion from Eqs. (35)-(36) is that the various multipole moments enter the different terms in 



this expansion at different orders in il. The precession frequencies and orbital frequency could be 
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Table 2: Accuracy with which a LISA observation could determine the multipole moments of a 
spacetime decreases as more multipoles are included in the model, taken from Ryan |312| . The 
third column indicates the highest multipole included in the particular model. Results are shown 
for two typical cases, a 10M Q + 10 5 M Q inspiral and a 1OM + 1O 6 M inspiral; in both cases the 
SNR of the inspiral is 10. 



extracted from GW observations, so these expansions are in principle observable. We can use this 
information to "map" the spacetime structure near the central object and verify that the multipole 



moments are consistent with the no-hair property (34) that we expect for a Kerr black hole. This 
technique is sometimes termed "bothrodesy" or "holiodesy'j^] by analogy with geodesy, in which 
observations of the motion of satellites are used to probe the gravitational field of the Earth. 

The multipole moments are also encoded in the total orbital energy lost as the orbital frequency 
changes by a unit logarithmic interval 
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A more powerful observable than the three discussed so far is the number of cycles that a trajectory 
spends near a particular frequency 

but this is not as clean an observable as the precession frequencies, since it requires computing the 
rate of energy loss to GWs in an arbitrary spacetime. Ryan used this formalism in conjunction 
with a post-Newtonian waveform model to estimate LISA's capability to measure the spacetime 
multipoles |312j . He considered nearly circular and nearly equatorial inspirals, and found that 
LISA's ability to determine the spacetime multipoles degraded as more multipoles were included 
in the waveform model. The typical errors that Ryan found are tabulated in Table [2j The 
conclusion was that LISA would be able to make moderately accurate measurements of the lowest 
three multipole moments, but probably no more. 

Ryan's analysis was restricted to circular and equatorial orbits, but a counting argument sug- 
gests that spacetime mapping should still be possible for generic orbits [235]. One complication is 



5 The term bothrodesy was coined by Sterl Phinney in 2001, arising from the use of the Greek word ftoQpoc; 
(meaning "sacrificial pit") to describe black holes. In modern Greek, fiodpoq has come to mean "sewage pit", so 
holiodesy is a suitable replacement that was first suggested by Marc Favata 1991 . 
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that the evolution of the orbital elements must also be inferred from the observation, which spoils 
the nice form of the expansions (35)-(36) |161| . since all the multipole moments now enter at each 
order of the expansion. However, this would also be true if the expansions for circular-equatorial 
orbits were rewritten as an expansion in some initial frequency, Qq-, which would more closely 
represent a band-limited observation. In practice, the lowest order multipole dominates the lowest 
term in the expansion and so on, which makes spacetime mapping possible in practice. 

The Ryan formalism neatly illustrates why spacetime mapping with EMRIs is possible, but it 
is not a very practical scheme. We expect that the massive central objects are indeed Kerr black 
holes and so we really want to consider what imprint small deviations from Kerr will leave on the 
emitted GWs. A multipolc-momcnt expansion is not a very practical way to do this, as the Kerr 
metric has an infinite number of nonzero multipoles. Several authors have adopted the approach 
of constructing spacetimes given by Schwarzschild-Kerr plus a small deviation, and have examined 
the properties of geodesies in those spacetimes. 

Collins and Hughes [55] considered a static deviation from the Schwarzschild metric. This was 
constructed by writing the metric in Weyl coordinates and adding a quadrupole perturbation to 
the potential (in these coordinates, the potential equation reduces to the flat-space Laplacian for 
the (p, z) cylindrical coordinates, which facilitates writing solutions). They considered two types 
of quadrupole perturbation: a torus around the black hole, and the addition of a point mass at 
each pole. In the second case, the spacetime necessarily contains line singularities running from 
the point masses either to infinity or to the black-hole horizon, which are needed to support the 
point masses. The solutions are perturbative, in that the authors kept only the terms that are 
linear in the deviation from Schwarzschild. Collins and Hughes explored the properties of orbits 
in these spacetimes by comparing the precession and orbital frequencies of equatorial orbits in 
the spacetime to orbits with the same orbital parameters in Kerr. They found that there were 
measurable differences in the perihelion precession in the strong field: for instance, at a radius 
of 20M for a 2% perturbation of the black hole, the trajectory would accumulate one radian of 
dephasing in ~ 1000 orbits. Collins and Hughes coined the term "bumpy black hole" to describe 
spacetimes of this type. 

Glampedakis and Babak [170] took a different approach to study deviations from Kerr. They 
used the Hartle-Thorne metric |181l 1182) to construct a spacetime with metric of the form <? p „ = 
ffjui/" + + 0((5M;>3, 6Si>2), working perturbatively and keeping only the 8M2 perturbation in 
the quadrupole mass moment. They termed the resulting spacetime a "quasi-Kerr" solution. A 
comparison of the frequencies of eccentric equatorial geodesies in the quasi-Kerr spacetime to the 
same geodesies in the Kerr spacetime indicated that it would take only ~ 100 cycles to accumulate 
a 7r/2 phase shift for a ~ 1% deviation from Kerr. They also computed waveform overlaps and 
found that, over the radiation-reaction timescale, the overlap of the waveforms for an orbit with 
p = 10 M was - 20%, 50%, 90%, 98% for inspirals with mass ratio n/M = 10~ 6 , 10~ 5 , 10~ 4 , 10" 3 
respectively. 

A third approach to analyzing deviations from the Kerr spacetime was considered by Gair 
et al. |161j . who studied geodesic motion in a family of exact spacetimes due to Manko and 
Novikov [250J , which include the Kerr spacetime for a specific choice of parameters. By using 
exact solutions of the Einstein field equations, they obtained solutions that were valid everywhere, 
in contrast to the perturbative solutions considered in [991 170] , which break down near the central 
object. However, this scheme offers less control over the multipole moments, since it is possible to 
choose the lowest multipole that differs from Kerr, but then the higher multipoles must al change. 
Gair et al. [161] studied observable properties of the orbits, including the variation of the precession 
frequencies of nearly circular and nearly equatorial orbits as a function of orbital frequency, and 
the loss of the third integral of the motion (see Sec. 6.2.5). 

These three papers outlined different ways to approach the problem of spacetime mapping in 
practice. However, none of the analyses were complete as they did not consider inspirals. Collins 
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and Hughes and Glampedakis and Babak also ignored waveform confusion by assuming that the 
orbital elements were the same between the orbits under consideration. Observationally, the correct 
orbits to compare will be those with the same frequencies since we have no way to determine the 
orbital radius or eccentricity directly. This was the approach adopted in Ref. [1(51] . Assessing the 
confusion problem is relatively easy, but including inspiral is very hard in general, since the presence 
of excess multipole moments in the spacetime leads to changes in the rates of energy and angular 
momentum loss, which must also be included in the analysis. Progress can be made in the presence 
of small deviations by including only the leading-order contributions to the radiation reaction from 
the multipole moments. This is an open area of research, although Barack and Cutler |43j carried 
out a preliminary assessment using post-Newtonian EMRI waveforms [42] augmented with the 
leading-order contribution of an excess quadrupole moment to the precession and inspiral rates. 
The resulting waveforms were an improvement in comparison to Ryan's analysis [312] . since they 
included orbital eccentricity and inclination, and were filtered through an approximation of the 
LISA response. Barack and Cutler performed a Fisher-matrix analysis of parameter-estimation 
uncertainties, and hence correctly accounted for the confusion issue. For this simple model, they 
found that a single LISA observation of the inspiral of a 1OM black hole into a 10 5 5 M Q , 10 6 M Q 
or 10 Mq black hole could measure the deviation from Kerr of the quadrupole moment to an 
accuracy of A(M 2 /M 3 ) ~ 1CT 4 , 1(T 3 , 1(T 2 , while simultaneously measuring the mass and spin 
of the central object to fractional accuracies of 10~ 4 . This suggests that a LIS A- like observatory 
would be able to perform high-precision tests of the no-hair property of massive compact objects 
in galactic centers. To put these numbers in perspective, a boson star may have a quadrupole 
moment that is ~ 100 times that of a Kerr black hole with the same mass and spin [313] . so it 
could be easily excluded. 

6.2.2 Probing the nature of the central object 

During an EMRI, the inspiraling object interacts gravitationally with the horizon of the central 
black hole. This can be thought of as a tidal interaction - the gravitational field of the small 
body raises a tide on the horizon that is dragged around through the orbit, leading to dissipation 
of energy - or as energy being lost by GWs falling into the black hole. Fang and Lovelace |136j 
explored the nature of the tidal-coupling interaction by perturbing a Schwarzschild black hole with 
a distant orbiting moon. They found that the time-dependent piece of the perturbation affected 
the orbit in an unambiguous way: a time-varying quadrupole moment is induced on the black- 
hole horizon that is proportional to the time derivative of the moon's tidal field. This quadrupole 
perturbation extracts energy and angular momentum from the orbit at the same rate that energy 
and angular momentum enter the horizon. However, the effect of the time-independent piece of 
the perturbation remained ambiguous. Working in the Regge- Wheeler gauge, Fang and Lovelace 
found that this piece vanished, in contrast to a previous result by Suen [347], who used a different 
gauge. This ambiguity leads to an ambiguity in the phase of the induced quadrupole moment as 
measured in a local asymptotic rest frame (LARF), although the phase of the bulge relative to 
the orbiting moon is well defined (using a spacelike connection between the moon and the black 
hole, Fang and Lovelace found that the horizon shear led the horizon tidal field by an angle of 
4Mfi, where f2 is the angular velocity of the moon). The ambiguity of interpretation in the LARF 
makes it impossible to define the polarizability of the horizon or the phase shift of the tidal bulge 
in a body-independent way. Fang and Lovelace left open the possibility of developing a body- 
independent language to describe the response of the central object to tidal coupling, but as yet 
this has not happened. 

Although the nature of the response of the central object to tidal coupling may be difficult to 
characterize from GW observations, the total energy lost to tidal interactions is a good observ- 
able. Ryan's original theorem |311j ignored tidal coupling, but it was later generalized by Li and 
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Lovelace [235]. They found that the GWs propagating to infinity depended only very weakly on 
the inner boundary conditions (i.e., on the nature of the central object). This means that the 
spacetime's multipole structure can be inferred from the outgoing radiation field in the usual way, 
and hence the expected rate of orbital energy loss, assuming no energy loss into the central body, 
can be calculated. The rate of inspiral measures the actual rate of orbital energy loss, and the 
difference then gives the rate at which energy is lost to the central object, which is a direct measure 
of the tidal coupling. Li and Lovelace estimated that the ratio of the change in energy radiated to 
infinity due to the inner boundary condition to the energy in tidal coupling scales with the orbital 
velocity as i> 5 . It should therefore be possible to simultaneously determine the spacetime structure 
and the tidal coupling through low-frequency GW observations. 

Information about the central object can also come from the transition to plunge at the end 
of the inspiral. In a black-hole system, we expect GW emission to cut off sharply as the orbit 
reaches the innermost stable orbit and then plunges rapidly through the horizon. If there is no 
horizon in the system, the orbit may instead enter a phase where it passes into and out of the 
material of the central object. This was explored for boson-star models in |216j : Kesden et al. 
found that persistent radiation after the apparent innermost orbit could be a clear signature of the 
presence of a supermassive, horizonless central object in the spacetime. This analysis did not treat 
gravitational radiation or the interaction of the inspiraling body with the central object accurately, 
but it does illustrate a possible way to identify non-black-hole central objects. Something similar 
might happen if the spacetime were to contain a naked singularity rather than a black hole [161] : in 
principle, the nature of the emitted waveform after "plunge" would encode information about the 
exact nature of the central object. However, this has not yet been investigated. Naked-singularity 
spacetimes may have very-high-redshift surfaces rather than horizons: these spacetimes would 
be observationally indistinguishable from black holes, unless the inspiraling object happened to 
move inside the high-redshift surface and then emerged, and the two epochs of radiation could be 
connected observationally. 

6.2.3 Astrophysical perturbations: the influence of matter 

A change in the inspiral trajectory need not be caused only by differences in the central object, 
but might arise due to the presence of material in the spacetime, close to the black hole but 
external to the event horizon. Such material could influence the inspiral trajectory, and hence the 
emitted GWs, in two distinct ways: the gravitational field of the matter could modify the multipole 
moments of the spacetime and hence the orbit as discussed above; if the orbital path intersected 
the material, it would cause sufficient hydrodynamic drag on the object to alter the orbit. 

The influence of the gravitational field of external material was considered in Ref . [37] : Barausse 
et al. constructed a model spacetime that included both a black hole and an external torus of 
material very close to the central black hole. They examined the properties of orbits in two 
systems: one with a torus of comparable mass and spin to the central black hole (spacetime "A"), 
and one with a torus of low mass, but much greater angular momentum than the central black 
hole (spacetime "B"). Their comparisons were based on computing equatorial geodesies and then 
"kludge" gravitational waveforms in the spacetime and in a corresponding Kerr spacetime, and 
then evaluating the waveform overlap. Orbits were identified by matching the radial and azimuthal 
frequencies between the orbits in the two spacetimes in two ways: altering the orbital parameters, 
while setting the mass and spin to be the same in the corresponding Kerr spacetime; or altering 
the mass and spin, while keeping the orbital parameters the same. 

This approach identified a confusion problem: over much of the parameter space the overlaps 
were very high, particularly when the second approach was adopted. Overlaps were lower in 
spacetime "B" than in spacetime "A," and overlaps for "internal" orbits between the black hole 
and the torus were particularly low. This work suggested that it would not be possible to distinguish 
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between such a spacetime and a pure Kerr spacetime in low-frequency GW EMRI observations. 
However, it did not consider inspiraling orbits. The need to constantly adjust the orbital parameters 
in order to maintain equality of frequencies would lead to a difference in the evolution of the orbit 
between the two spacetimes which might break the waveform degeneracies. The torus model was 
also not physical, being much more compact than one would expect for AGN discs. 

The effect of hydrodynamic drag was estimated in [IB]: Barausse and Rezzolla considered a 
spacetime containing a Kerr black hole surrounded by a non-self-gravitating torus with constant 
specific angular momentum. The hydrodynamic drag consists of a short-range part that arises from 
accretion, and a long-range part that arises from the gravitational interaction of the body with the 
density perturbations it causes in the disc. The accretion onto the small object was modeled as 
Bondi-Hoyle accretion [71] [70] and the long-range force using collisional dynamical-friction results 
from the literature [2191 143] . The effect of the hydrodynamical drag on the orbital evolution was 
computed for geodesic orbits and compared to the orbital evolution from radiation reaction for a 
variety of torus models, varying in mass and outer radius. 

The conclusion was that, for realistic outer radii for the torus, r out > 10 4 M, the effect of 
hydrodynamic drag on the orbital radius and eccentricity was always small compared to radiation 
reaction. However, the relative importance increases further from the central black hole. The 
hydrodynamic drag has a greater relative effect on the orbital inclination, and tends to cause orbits 
to become more prograde, which is opposite to the effect of radiation reaction. For r out = 10 5 M, 
the hydrodynamic drag becomes significant at r rs 35 M, but this radius is smaller for a more 
compact torus. Thus, this effect will only be important for LISA if we observe a system with a 
very compact accretion torus, or for systems of low central mass. For the latter, the GWs are 
detectable from orbits at larger radii where hydrodynamic drag can be important. However, the 
SNR of such events will be low, so we are unlikely to see many of them [157] . Thus, although it 
seems that this effect is also marginal, this conclusion is based on considering geodesic orbits, and 
the possible secular build up of a drag signature over the inspiral has yet to be examined, 

The influence of an accretion disc on the evolution of an EMRI embedded within it was explored 
in [3971 1222j . One of the channels that has been suggested to produce EMRIs is the formation of 
stars in a disc followed by the capture of the compact stellar remnants left after the evolution of 
those stars [2U [T!5] . The migration of such an EMRI through the accretion disc could potentially 
leave a measurable imprint on the GW signal. It was shown that over a year of integration, "type-I" 
migration could lead to a ~ 0. 01/10 radian dephasing in an EMRI signal for a//3-type discs, while 
"type-II" migration, happening after a gap opens in the disc, could lead to dephasings of ~ 10/10 3 
radians. The effects are larger for /3 discs, due to their higher surface density. For more massive 
central black holes ~ 10 6 M© the dominant contribution is from Bondi-Hoyle-Lyttleton accretion, 
while for less massive black holes of ~ 10 M@ the dominant contribution is from the migration. 
These dephasings were computed for the same system parameters (apart from maximizing over a 
time and phase shift), so they do not account for possible parameter correlations, but the authors 
argue that the migration dephasing decouples from GW parameters as the effect becomes weaker 
with decreasing orbital radius, while relativistic effects become stronger. The relative number of 
EMRIs that will be produced in discs rather than from other channels is not well understood. 
However, it will be straightforward to identify such EMRIs, which will be circular and equatorial, 
to look for and constrain effects of this type. 

Another important question that has not yet been addressed is how to distinguish the effect 
of external material from a difference in the structure of the central object. If the orbit does not 
intersect the material, such identification would come from the variation in the effect over the 
inspiral — if the change in the multipole structure comes from material, then at some stage the 
object would pass inside the matter, and the qualitative effect on the inspiral would be different 
from that of a central object with an unusual multipole structure. If the orbit does intersect 
the material, then the spacetime-mapping analysis described above no longer applies, since the 
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Geroch-Hansen multipole decomposition [1671 I178j applies to vacuum spacetimes only. If this 
decomposition could be generalized to nonvacuum axisymmetric spacetimes, then low-frequency 
GW observation could potentially recover not only the spacetime metric but also the structure of 
the external material. It would then be possible to verify that this matter obeys the various energy 



conditions (see Sec. 2.3). This is an open area of research at present. 

An independent indicator of the presence of material in the spacetime would come from the 
observation of an electromagnetic counterpart to an EMRI event. For instance, if an inspiraling 
black hole was moving through an accretion disc, there might be emission from the material that 
was accreted onto the inspiraling object or from shocks formed in the disc. Again, this has not been 
explored, although it is likely, given the poor sky-position determination of EMRI events in GW 
observations [42] , that it would not be possible to conclusively identify such a weak electromagnetic 
signature in coincidence with a GW event. 

The presence of exotic matter outside a black hole, in the form of a cloud of axions, was 
discussed in [33] . The presence of large numbers of light axions would be one consequence of extra 
dimensions in string theory, so the detection of an axion cloud would provide strong evidence in 
support of their existence. The axion cloud would modify the motion of an inspiraling black hole 
in a similar way to regular matter, although estimates for the precision of measurements possible 
with future GW detectors have not yet been carried out. The passage of an EMRI through the 
cloud could also lead to its disruption, which may rule out the cloud as a possible explanation 
for any observed deviations, but further theoretical work is required to properly quantify these 
processes. 

The existence of axion clouds can also have other observable GW signatures. The axions in the 
cloud exist in different quantum energy states. If multiple states with the same orbital momenta 
/ and magnetic moments m but different principal quantum numbers n are occupied, transitions 
between these states can generate GWs with characteristic strain 
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for a black hole of mass Mbh at a distance r. The pre factor a 2 (eo£i) 1//2 depends on the axion 
masses and coupling. The axions can also undergo annihilations, which generate GWs with very 
similar characteristic strains 



In both cases the frequency depends on the black- hole mass in the usual way / ~ I /Mbh, with 
typical values for 2Mq black holes of 100 a 3 Hz and 30 a kHz respectively. Both eLISA and LIGO 
could place interesting constraints on the axion parameter space through (non)detections of these 
events [33] . Finally, the self- interactions in the axion cloud could eventually lead to the collapse of 
the cloud in a "bosenova" explosion, which would generate GWs with strain 

h~ 10-- (<) ( ^MP£ (41) 

and frequency ~ c 2 /GMbh- These could also be an interesting source for low- frequency GW 
detectors. 

6.2.4 Astrophysical perturbations: distant objects 

Perturbations to the EMRI trajectory could also arise from the gravitational influence of distant 
objects, such as other stars or a second massive black hole. At present, it is thought to be very 
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unlikely that a second star or compact object would be present in the spacetime sufficiently close 
to the EMRI to leave a measurable imprint on the trajectory |24) . although detailed calculations 
for this scenario have not been carried out. However, if the massive black hole that was the host of 
the EMRI was in a binary with a second massive black hole, this could perturb the trajectory by a 
detectable amount |398) . The primary observable effect is a Doppler shift of the GW signal, which 
arises due to the acceleration of the center of mass of the EMRI system relative to the observer. 
It was estimated that, for typical EMRI systems, the presence of a second black hole within a few 
tenths of a parsec would lead to a measurable imprint in the signal. The frequency scaling of the 
Doppler effect differs from the scaling of the post-Newtonian terms in the unaccelerated waveform, 
which suggests that this effect will be distinguishable in GW observations [398] . The magnitude of 
the leading-order Doppler effect scales as M scc /r 2 , where M soc is the mass of the perturbing black 
hole, and r is its distance. If the second object is within a few hundredths of a parsec, higher-order 
time derivatives could also be measured from the GW observations. These scale differently with 
M sec and r, which would in principle allow the mass and distance of the perturber to be measured 
from the EMRI data [398] . 

The probability that a second black hole would be within a tenth of a parsec of a system 
containing an EMRI is difficult to assess. At redshifts z < 1, at any given time a few percent 
of Milky Way-like galaxies will be involved in a merger, which suggests an upper limit of a few 
percent of EMRIs that could have perturbing companions. However, there is uncertainty as to how 
long the binary black hole will spend at radii of a few tenths of a parsec following a merger, and it 
is plausible that the presence of a second black hole would increase the EMRI rate by perturbing 
stars onto orbits that pass close to the other black hole, introducing an observational bias in favor 
of these systems [398] . Given these uncertainties, the possibility of a distant perturber will have to 
be accounted for in the analysis, and, if it is observed in some systems, LISA-like detectors could 
indirectly inform us the processes that drive massive-black-hole mergers. 

The gravitational influence of a second stellar mass black hole on the evolution of an EMRI 
was considered in [23 . It was shown that a second black hole with a semi-major axis of < 10 _5 pc 
could influence the orbital parameters of an EMRI ongoing in the same galaxy and already in 
the low-frequency GW band. This influence is chaotic, leading to unpredictable evolution of the 
orbital parameters. It was estimated than in 1% of EMRIs a second black hole could be within 
the required distance. However, the timescale of the chaotic motion was ~ 10 9 s, which is much 
longer than a typical low-frequency observation, and the system considered in (23] had an orbital 
period of 6 x 10 3 s, which would probably not yet be detectable by LISA-like detectors. On the 
timescale of a GW mission, the effect would probably manifest itself as a linear drift in the orbital 
parameters, and therefore it is unlikely that this effect would prevent the detection of an EMRI 
signal. A full analysis of the effect on parameter estimation has not yet been carried out. 

6.2.5 Properties of the phase space of orbits 

Loss of the third integral. It was demonstrated by Carter [55] that the Kerr metric has a 
complete set of integrals — in addition to the energy and angular momentum that arise as conserved 
quantities in any stationary and axisymmetric spacetime, geodesies in the Kerr metric conserve 
the Carter constant, Q. This is the analog of the third isolating integral found in some classical 
axisymmetric systems, and has been shown to arise due to the existence of a Killing tensor of 
the spacetime [378j . In the Schwarzschild limit, Q is the sum of the squared angular momentum 
components in the two equatorial directions. The Kerr metric is one of a very special class of metrics 
that have this property. In fact, Carter |88| demonstrated that it was the only axisymmetric metric 
not containing a gravimagnetic monopole for which both the Hamilton-Jacobi and Schrodinger 
equations were separable. The special nature of the Kerr metric was emphasized by Will 383 who 
demonstrated that, in Newtonian gravity, motion about a body with an arbitrary set of multipole 
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Figure 6: Poincare map for a regular orbit (left panel) and a chaotic orbit (right panel) in the 
Manko-Novikov spacetime. Plot reproduced from Gair et al. |161j . 

moments Mi possesses a third integral of the motion only if the multipole moments obey the 
conditions 



which are precisely the conditions satisfied by the mass moments of a Kerr black hole, Eq. ( |34[ ) . 

The separability of the Kerr metric aids the analysis of inspirals in that spacetime, but it also 
suggests another potential observable that would show a deviation from the Kerr metric in a GW 
observation. The specialness of the third integral in the Kerr spacetime suggests that if a spacetime 
differed from the Kerr solution, even by a small amount, the third integral might vanish, which 
would potentially lead to the existence of ergodic orbits. If such orbits were observed it would be a 
clear "smoking gun" for a deviation from the Kerr metric. The existence of ergodic orbits in various 
spacetimes has been explored by several authors. The standard approach is via construction of a 
Poincare: a geodesic is computed in cylindrical coordinates (p,z,<p,6), and the values of p and p 
recorded every time the particle intersects a specified plane z =constant. If the resulting plot of 
all these points on a (p, p) plane yields a closed curve, then a third integral exists, otherwise the 
orbit is ergodic. This is illustrated in Fig. |6.2.5| 

Chaotic motion has been found by Sota et al. for orbits in the Zipoy-Voorhees-Weyl and Curzon 
spacetimes |333j : by Letelier and Viera for orbits around a Schwarzschild black hole perturbed by 
GWs |234j ; by Gueron and Letelier for orbits in a black-hole spacetime with a dipolar halo |175j , and 
in prolate Erez-Rosen bumpy spacetimes [176] ; and by Dubeibe et al. for some oblate spacetimes 
that are deformed generalizations of the Tomimatsu-Sato spacetime |124) . None of these examples 
represented systems that were small deviations from the Kerr metric. The only investigation to 
date of ergodic orbits in the context of LISA was by Gair et al. |161j . who explored geodesic 
motion in a family of spacetimes due to Manko and Novikov [250] that had arbitrary multipole 
moments, but which included Kerr as a special case. Gair et al. [161] considered orbits in a family 
of spacetimes parameterized by a single "excess quadrupole moment" parameter, q, such that q = 
represented the Kerr solution. They found that, while the majority of orbits in these spacetimes 
possessed an apparent third integral, ergodic orbits existed very close to the central object for 
arbitrarily small oblate deformations of the Kerr solution. As the spacetime was deformed away 
from Kerr, a second allowed region for bound geodesic motion was found to appear close to the 
central black hole, in addition to the allowed region present in the Kerr metric. Ergodic orbits 
were found only in this additional bound region. Gair et al. |161j concluded that this ergodic 



M 2 i+i = 



M 2 i = m(M 2 /m) 



for every / 



(42) 



4G 



region would probably be inaccessible to an object that was initially captured at a large distance 
from the central object. This analysis was revisited in |242j but the conclusions in that paper were 
the same. The only difference was that the authors in |242j identified a region of stable motion 
within the inner region that contains chaotic orbits. The chaotic orbit shown in the right hand 
panel of Fig. |6.2.5| appears to pass in and out of the region of stability which should not happen, 
so this might be a numerical artifact. However, the existence of chaotic orbits and the probable 
inaccessibility of these orbits to inspirals was confirmed by [242\. 

Brink [73 also explored integrability in arbitrary stationary, axisymmetric and vacuum space- 
times, concentrating on regions where the Poincare maps indicated the presence of an effective 
third integral. Brink hypothesized that some spacetimes might admit an integral of the motion 
that was quartic in the momentum, in contrast to the Carter constant which is quadratic. This 
hypothesis is as yet unproven. There is also a potential conflict with the example given in [161] . 
The Kolmogorov, Arnold, and Moser (KAM) theorem indicates that when a Hamiltonian system 
with a complete set of integrals is weakly perturbed, the phase space motion will either be confined 
to the neighborhoods of the invariant tori, or the motion will be chaotic [349J . Thus, if there is 
a region of the spacetime where chaotic motion exists, there cannot be another region where a 
third invariant exists. A mathematical demonstration that the orbits can possess an approximate 
invariant, while technically being chaotic, is lacking at present, although this does appear to be 
the case from the numerical calculations |161j . It is also not entirely clear that the perturbation 
can be regarded as "small" everywhere, since the change in mutipole moments necessarily changes 
the horizon structure and so the perturbation is infinitely large at certain points. 

A discussion of the reason for the existence of chaotic geodesies in some spacetimes was given by 
Sota et al. [333] . They suggested that it would arise either from a change in sign of the eigenvalues 
of the Weyl tensor at a point, which would lead to a local "instability," or from the existence 
of homoclinic orbits. The latter explanation applied only to non-reflection-symmetric spacetimes, 
while most spacetimes of astrophysical interest should be reflection symmetric. The Weyl-tensor 
analysis has not been carried out for the Manko-Novikov family of spacetimes [161] . One other 
proposed explanation was the existence of a region of closed timelike curves, which was found to 
touch the region in which ergodic motion was identified |161j . 

In conclusion, it seems unlikely that ergodic geodesies will be found in nature, but if they 
were identified we would know immediately that the spacetime was not Kerr. However, detecting 
chaotic motion from a GW observation is challenging. One possibility would be to observe the 
transition from regular to geodesic motion in a time-frequency analysis: the regular motion would 
be characterized by a few well distinguished peaks in a Fourier transform of the signal, while ergodic 
motion would show a much broader-band structure |161j . However, it is not clear that it would 
be possible to distinguish the ergodic phase from detector noise, and hence there would be no way 
to identify an inspiral that "ends" by entering an ergodic phase as opposed to one which ends at 
plunge into a black hole. If an orbit passed into an ergodic phase and then back into a regular 
phase we might see a signal turn "on" and "off" repeatedly. However, the ergodic motion would 
randomize the phase at the start of the regular motion, so to detect such a signal we would need 
each regular phase to be long enough that they were individually resolvable by matched filtering. 
This would require extreme fine tuning of the system parameters |161j . 

Persistent resonances. Eccentric and inclined extreme-mass-ratio inspirals will generically 
pass through transient resonances at which the radial and 9 frequencies become commensurate. 
For EMRIs in GR, these resonances will be isolated, and the transition through resonance will 
proceed on the usual radiation reaction timescale but with a temporary modification in the energy 
flux on resonance 145 . However, according to the Poincare-Birkhoff theorem, when an integrable 
system is perturbed it causes the appearance of a Birkhoff chain of islands whenever the frequen- 
cies of the system are at resonance. Therefore, in a perturbed Kerr spacetime, another observable 
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consequence would be that the EMRI frequencies could remain on resonance for many more cycles, 
providing another "smoking gun" for a deviation from Kerr ^27\ [243 . Detection of a persistent 
resonance in a matched-filtering search will require a modification of the search pipeline, but it 
should be considerably more straightforward than detection of ergodicity, as the signal will be 
coherent and could therefore be identified using time-frequency methods, or a phenomenological 
waveform model. However, this has not yet been studied in any detail. 

As mentioned in Sec. |5.1.3[ in massive scalar-tensor theories, a different type of persistent reso- 
nance can occur, in which a super-radiant scalar flux balances the GW flux [399J . Such resonances 
can last a significant fraction of a Hubble time and so observing a single EMRI offers significant 
constraining power on the space of massive scalar-tensor theories. This is not a test of black-hole 
structure, since the resonance is between the scalar and gravitational fluxes, rather than in the 
geodesies of the central object, but the observable effects are similar so we mention it here. 



6.2.6 Black holes in alternative theories 

Kerr black holes. The Ryan mapping approach uses observations of precession frequencies as 
functions of the orbital frequency to extract the spacetime metric from gravitational waveforms. 
However, even if the metric is found to be Kerr, this is not enough to verify GR. It was pointed out 
by Psaltis et al. |301j that the Kerr metric is a solution to the field equations for several alternative 
theories of gravity. Essentially, since the Kerr metric has vanishing Ricci tensor, i? M „ = 0, any 
theory in which the vacuum field equations depend only on will also admit Kerr as a solution. 
Allowing for a nonzero cosmological constant, A, a black- hole solution in GR satisfies 

R 

R^ = ^9^, # = 4A, and R»=0, (43) 

in which R denotes the Ricci scalar and is the spacetime metric |301j . Psaltis et al. discussed 
four different alternative theories, which we already described in Sec. |2.1| 



• f(R) gravity in the metric formalism. If f(R) is expanded as a Taylor series ao + R + 
a<iR? + ••', there are three possible cases, (i) If a = 0, which corresponds to R = 0, the Kerr 
solution is always a solution to the equations of motion, (ii) If the Taylor series terminates 
at 02, all constant curvature solutions of GR (with any A) remain exact solutions of the f(R) 
theory, (iii) If ao ^ and the series does not terminate at 02, constant-curvature solutions of 
GR will be solutions of the f(R) theory with different values of the curvature. The difference 
in curvature will be small, however. 

• f(R) gravity in the Palatini formalism. In this case, any constant-curvature solution 
of GR is also a solution to these equations, with the same Christoffel symbols. This is 
unsurprising, as it is known that Palatini f(R) gravity reduces to GR in vacuum |48) . 



• General quadratic gravity. For any black-hole solution in GR, the tensors K K \ and L 



a A 



that appear in the field equations ( 18 1 both vanish, and the field equations reduce to those 



of GR. Hence all black-hole solutions of GR are solutions in this theory. 

• Vector— tensor gravity. In this case, we find once again that all constant-curvature solu- 
tions of GR are solutions to the equations, but with a shifted value of the curvature that 
depends on the vector field strength R = 16A/(4 + (4cj + 3r])K 2 ). 

The action for general quadratic gravity introduced by Stein and Yunes [406) also admits the 
Kerr solution, but only in the nondynamical version of the theory in which the functions fi(9) are 
constants. In that case the field equations are once again satisfied by spacetimes with R a t = 0, 
and so the vacuum solutions of GR are solutions to the field equations in these theories as well. In 
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the dynamical version of the theory, the Riemann tensor enters the field equations explicitly and 
so R a b = is no longer sufficient to satisfy them. We will discuss those black-hole solutions in the 
next section. 

Although it is true that all of these theories admit the Kerr metric as a solution, this does 
not mean that we have no way to distinguish between them via GW observations. This was not 
discussed in [301] . but is argued in a comment on that paper by Barausse and Sotitirou |48j . First, 
the uniqueness theorems of GR [284. f 851 12071 1591 1307] do not necessarily apply in these alternative 
theories. In other words, just because the Kerr metric is a solution does not mean that we would 
expect it to form as a result of gravitational collapse. This is an equally important consideration 
as to what we would expect to see in the Universe, although this argument can be sidestepped 
by suitable fine tuning. In f(R) gravity, the metric of a spherically symmetric body is not the 
Schwarzschild metric, but has a Yukawa correction [S3]. The constraints that LISA could place on 
such a deviation from the Kerr solution were investigated in [56] . Expanding f(R) to quadratic 
order (R + a2R 2 /2), it was found that EMRI observations could place a bound |a.2 1 < 10 17 m 2 , 
about an order of magnitude better than the bound from observations of planetary precession 
in the Solar System, | a-2 1 < 1-2 x 10 18 m 2 . However, the bound from the Eot-Wash laboratory 
experiments is many orders of magnitudes better, \a 2 \ < 2 x 10~ 9 m 2 . 

The GW constraints will be obtained in a very different curvature regime and could be of 
interest if the chameleon mechanism is invoked to say that the effective f(R) might take different 
forms in different regimes. It was subsequently shown that the end state of gravitational collapse 
in f{R) theories (and scalar-tensor theories) is not the point-mass limit of an extended body, but 
is in fact the Kerr solution j336) . The results in [SHj therefore do not apply to black holes in f(R) 
gravity, but would apply for an exotic horizonless compact object if one existed. The constraints 
that low-frequency GW could place on this metric can also be considered to be constraints on a 
strawman Yukawa-like deviation from the Kerr solution, without reference to a specific theory in 
which such corrections arise. 

Another consideration is that the spacetime metric is not the only GW observable. GWs are 
generated by perturbations of the spacetime and hence depend on the full dynamical sector of 
the theory. The response to perturbations will therefore be different if the field equations are 
different, even if the unperturbed metric is the same. In |48j it was demonstrated that, for metric 
f(R) gravity, linearizing about the Minkowski spacetime gives rise to massive graviton modes in 
addition to the standard transverse-traceless modes of GR. These cannot be zeroed out by a gauge 
transformation. To date, no one has considered the generation of such massive modes in a binary 
system nor the observational consequences. However, if these modes are generated, there would 
be two natural ways to find them: a direct GW detection (the modes have different polarization 
states and propagation velocities), and an indirect detection, as the binary system would inspiral 
faster than expected due to loss of additional energy in the extra modes. It is not clear whether 
the latter will be distinguishable from tidal interaction effects, and quantitative estimates of the 
power of tests that would be possible with LISA-like detectors have not yet been done. 

In interpreting GW observations, it will be necessary to verify both the static and dynamic 
aspects of the theory. The Ryan mapping algorithm is the natural way to start, but if a metric is 
found to be consistent with Kerr it will then be necessary to verify that the observed GWs and 
energy loss are also in agreement before concluding that we are observing a system consistent with 
a Kerr black hole in GR. 



Non-Kerr black-hole solutions. Certain alternative theories of gravity do not admit the Kerr 
metric as a solution: these include the dynamical version of general quadratic gravity described by 
Eq. ( 19 ). In general, it is difficult to solve the field equations in alternative theories, so few analytic 
black-hole solutions are known outside of GR. Solutions are known in dynamical Chern-Simons 
gravity [400] . which is a special case of Eq. (19) with a% — a 2 = az = 0, and in Einstein-Dilaton- 
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Gauss-Bonnet (EDGB) gravity |258j . in which gravity is coupled to dilaton and axion fields. In 
the weak-coupling limit, spherically symmetric and stationary solutions to the general quadratic- 
gravity theory described by Eq. (19) take the same form as the solutions to EDGB gravity [406] . 

In dynamical Chern-Simons gravity, it was shown that the metric describing a slowly rotating 
black hole differs from the slow-rotation limit of the Kerr metric beginning at the fourth multipole, 
I = 4 |332| . The energy-momentum tensor of GWs takes the same form as it does in GR, which 
means that the deviation from GR will show up primarily through differences in the geodesic orbits 
in the spacetime. This is completely analogous to spacetime mapping within GR, as described 



in section 6.2.1 so the prospects for detection of such deviations with LIS A- like detectors are 
comparable. Sopucrta and Yunes 332 estimated that IMRI or EMRI events observed by LISA 
would be able to put constraints on the coupling parameter £ of dynamical Chern-Simons gravity 
of £i < 10 2 km. These results were very approximate. A more complete study including a Fisher 
matrix analysis to account for parameter correlations is described in 79, 180] . There it was found 
that LISA observations of EMRIs could place a bound £i < 10 4 km. This is somewhat weaker 
than the bound estimated for IMRIs in |332j . but better than the estimate for EMRIs in that 
paper, so IMRI systems (involving a ~ 10M Q object inspiraling into a ~ 1O 3 M object) may be 
able to place even more stringent constraints than originally estimated, if they were observed. The 
EMRI bound < 10 4 km is four orders of magnitude better than the best Solar System bound, 
which is based on data from Gravity Probe B and LAGEOS satellites |1351 |9"B] . 

In [400] the bound from current binary-pulsar observations was estimated to be four orders of 
magnitude better than that in the Solar System and hence comparable to the expected GW result. 
However, this bound was based on an upper limit on the rate of precession. It was argued in [16] 
that an upper bound could not place a constraint on the CS deviation since the sign of the CS 
contribution is opposite to that in GR, and so a bound could only be placed if a lower bound on 
the precession lying below the GR value could be found. The Solar-System bound is therefore the 
best current constraint, which GW observations will improve by several orders of magnitude. The 
Chern-Simons black- hole metric has recently been derived to quadratic order in the spin [393], so 
the results discussed here can now be extended to this higher order metric, but this has not yet 
been done. 

In the small-coupling limit, it can be shown that the energy-momentum tensor of GWs in 
the general quadratic-gravity theory [Eq. ( 19 1] also follows the same quadrupole formula as in 
GR [3411 . In this more general case, therefore, the primary observable difference will once again 
be in the orbits that the inspiraling compact object follows in the background spacetime of the 
central black hole. For circular-equatorial inspirals, the "post-Einsteinian" parameters [401] for 
the weak-coupling limit of this theory were computed as a = (5/6)£, /3 = (50/3)C, a — 4/3 = b, 
c = —4/5 = d, in which ( is a dimensionless coupling constant that can be used to characterize all 
of these theories. If a GW observation is used to place bounds on these ppE parameters, this can 
therefore be directly translated into a bound on this wide class of alternative theories of gravity. 



As discussed in Sec. 5.2.2 the authors of [372] considered more general possible forms for 
black-hole solutions without specifying a particular alternative theory. By imposing the existence 
of a Carter-like third integral of the motion, asymptotic flatness, and the peeling theorem, and by 
setting as many metric components to zero as possible while still recovering all known modified 
gravity spacetime metrics, they obtained a family of generic modified black-hole solutions. It 
was subsequently realized that all solutions admitting a Carter constant had been found in |55j . 
but [372J identifies the physically realistic subset of these general solutions. This approach is 
suited to the construction of eccentric, inclined inspirals, and in |156j the gravitational waveforms 
for generic EMRIs occurring in these metrics were constructed by extending the "analytic-kludge" 
framework [32] • As yet, these waveforms have not been used to assess the ability of a space-based 
detector to constrain such deviations from the Kerr solution, but work in this area is ongoing. 
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6.2.7 Interpretation of observations 

The previous subsections have identified some of the possible causes of deviations in the structure 
of the central object from the Kerr metric, as well as some observational consequences of these 
deviations. Nevertheless, interpreting GW observations correctly is a nontrivial challenge. Our 
working assumption is that the massive compact objects objects that occupy the centers of most 
galaxies are indeed Kerr black holes. It is therefore reasonable to design an approach to spacetime 
mapping that looks for inspirals into Kerr black holes and quantifies any deviations from such 
inspirals that may be present. One such approach to spacetime mapping was described in |158) . 
The starting point is to assume that GR is correct, and that the source's spacetime is vacuum and 
axisymmetric. The multipole moments can be extracted from the precession frequencies via Ryan's 
algorithm, and then the expected gravitational waveform and inspiral rate for such a spacetime in 
GR can be computed. 

If the observations are consistent there is no evidence to contradict the initial assumptions, but 
we can still do several tests, by asking the following questions: (i) Does the "no-hair" property 
hold for the multipoles? If there is a deviation, the spacetime must either contain closed-timelike 
curves, or lack a horizon, (ii) Does the emission cut off at plunge or does the radiation persist? 
(iii) Is the tidal interaction consistent with a Kerr black hole? If the radiation still cuts off at a 
plunge and the multipole structure is not consistent with the Kerr metric, it might indicate the 
presence of a naked singularity, which would be a violation of the cosmic-censorship hypothesis. 

If the observations are not consistent with GR, the first assumption to relax would be that 
the spacetime was vacuum. In principle, it might then be possible to deduce both the multipole 
moments of the spacetime and the energy-momentum tensor of the matter distribution from the 
observations. The gravitational waveform could then be computed for such a spacetime in GR, and 
checked for consistency with the observations. If the observations are consistent, similar questions 
can be asked: (i) Does the emission cut off at plunge or does the radiation persist? (ii) Is the tidal 
interaction consistent with a Kerr black hole? (iii) Does the matter distribution obey the strong, 
null, and weak energy conditions, or have we identified an exotic matter distribution? Only if the 
observations are inconsistent will there be evidence of a failure in GR itself. 

The main complication to this approach is that the detection of EMRIs will rely on matched 
filtering using waveform templates. If a system differs significantly from a Kerr inspiral, then it 
may not be picked out of the data stream. An alternative approach was discussed by Brink [72\ . 
She suggested that the spacetime-mapping problem could be thought of as analogous to inverse- 
scattering problems in quantum mechanics, where the potential to be determined is the Ernst 
potential |132j . but the technical details of such a method have not yet been worked out. Brink 
also suggested an approach with a similar philosophy to what we discuss above: assuming that 
the instantaneous geodesic is triperiodic (i.e., that it has a complete set of integrals), there is a 
good chance it will have a high overlap with a Kerr geodesic. Each "snapshot" along the inspiral 
thus defines a point in the Kerr geodesic space, and so matched filtering with Kerr snapshots can 
pick out an inspiral trajectory. The inspiral trajectory in the Kerr spacetime can be computed, 
and we can check to see if the observed inspiral deviates from the predicted one. This approach 
has two drawbacks: first, it relies on the GWs from a geodesic in an arbitrary spacetime having a 
high overlap with those from at least one Kerr geodesic; second, the interpretation of deviations 
in the trajectory relies on being able to compute GW emission in alternative metrics in GR, or in 
alternative theories of gravity. However, these will be problems common to all spacetime-mapping 
algorithms. 

In summary, it is clear that low-frequency GW detectors will be able to check that an EMRI is 
consistent with Kerr, which can be interpreted as a statement on how much the spacetime could 
differ from the Kerr metric [221133] and still be consistent with our search templates. What is more 
difficult is to say exactly what LISA will tell us if we observe something that does differ from the 
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Kerr metric. More work is still needed in this area. 
6.2.8 Extreme mass-ratio bursts 

Another class of low-frequency GW sources that are closely connected to EMRIs are the so-called 
Extreme-Mass- Ratio Bursts (EMRBs) [309 . These are the precursors to EMRIs: in the standard 
EMRI formation channel, compact objects are perturbed onto orbits that pass very close to the 
central black hole, emitting bursts of GWs as they do so, which can lead to capture if sufficient 
orbital energy and angular momentum are lost to GWs. Furthermore, just after capture, EMRI 
compact objects proceed on very eccentric orbits that radiate in bursts near pericenter rather than 
continuously. Similar bursts will be generated by "failed" EMRIs — objects that only encounter 
the central black hole a few times before being perturbed onto an unbound or plunging orbit. 

For a sufficiently nearby system (in the Virgo cluster or closer [309] ). these bursts could be 
individually detected by LISA-like detectors. The low SNR of such events compared to EMRIs is 
partially outweighed by the significantly larger number of compact objects undergoing "flybys" at 
a given time, so the detection rate could be as high as 18 yr -1 , of which 15 yr _1 would originate 
in the Milky Way |309| . These rate estimates employed several simplifying assumptions. A more 
careful analysis predicted only ~ 1 event per year [197] . and a proper analysis of our ability to 
identify such bursts in the presence of signal confusion has not been carried out. It is therefore 
unclear at present whether any if these events would be identified in the data of LISA-like detectors. 

It has been suggested |404| that EMRBs could be used to test black-hole structure in a similar 
way to EMRIs. This assertion was based on comparing approximate burst waveforms coming from 
spinning and nonspinning central black holes with the same system parameters. In [57], an analysis 
of parameter estimation for EMRBs accounting for parameter correlations was performed, which 
concluded that a space-based detector like LISA could measure many of the parameters of the 
system to moderate precision, including the mass and spin, provided that the periapse of the burst 
orbit was within ~ lOGM/c 2 . This analysis did not consider sky position or eccentricity to be free 
parameters, taking the former to be the center of the Milky Way, and the latter to be maximal, and 
it treated the distance and compact-object mass as a single amplitude parameter m/D. The small 
object mass cannot be separately determined because there is no significant orbital evolution over 
the duration of one burst. The analysis did not consider constraints on GR through measurements 
of other multipoles, but it found that the mass and spin could both be measured to high accuracy 
for orbits that pass sufficiently close to the black hole, so may be possible to place interesting 
constraints on GR deviations in this way. However, it is unlikely that this will happen in practice, 
since the event rate for such relativistic systems is very low |198j . 

For extra-Galactic systems, the sky position is unknown and so a single burst will not provide 
enough information to allow determination of all the system parameters. If an EMRB was observed 
to "repeat" several times over the lifetime of a GW mission, then more information could be 
derived about the system. However, there are serious practical issues with associating multiple, 
well-separated bursts with the same source. The inspiral orbit might also be perturbed between 
pericenter passages by encounters with other stars. The prospects for doing useful tests of black- 
hole structure with EMRBs therefore seem rather bleak. Moreover, any tests that can be carried 
out will inevitably be much weaker than those based on EMRIs, as an EMRB event will generate 
several waveform cycles rather than the several hundred thousand from an EMRI. 

6.3 Tests of black-hole structure using ringdown radiation: black-hole 
spectroscopy 

When a black hole is perturbed, it rapidly settles back down to a stationary Kerr state via the 
radiation of exponentially damped sinusoidal GWs known as quasi-normal modes (QNMs). For 
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Figure 7: Comparative SNRs, as a function of redshifted black-hole mass (1 + z)M, for the last 
year of inspiral of an equal-mass massive-black-hole binary and for the ringdown after the merger 
of the system. The method used to generate this figure follows that of |147j . updated to use a 
modern LISA sensitivity curve |231j with a low-frequency cutoff of / = 1 x 10~ 4 Hz. The redshift 
is set to z = 1, at which the luminosity distance is =6.6 Gpc using WMAP 7-year parameters 
[224] . 

a Kerr black hole in GW, the QNM frequencies and damping times are uniquely determined by 
the mass and spin of the black hole. Observations of two or more QNMs from the same system 
will therefore test that the end-state of the merger is a Kerr black hole, if they provide consistent 
estimates for the mass and spin of the central object. QNMs will be excited during the merger of 
supermassive black holes and will dominate the late-stage radiation, so LISA-like detectors should 
observe QNM radiation with relatively high SNRs [1461 Q221 EQ] . In fact, the SNR from the QNMs 
alone may be comparable to the total SNR in the inspiral |146| . This is illustrated in Fig. [7j 
which shows the SNR with which LISA would detect the inspiral and the ringdown for equal-mass, 
nonspinning massive-black-hole binaries as a function of black-hole mass. The ringdown dominates 
the SNR for redshifted masses greater than 3 x 1O 6 M . 

Reviews of the theory of quasi-normal modes can be found in Kokkotas and Schmidt [223 
and also in Nollert |276j . but we will briefly summarize the relevant results here. At late times, 
the gravitational radiation from a perturbed black hole can be written as a sum of exponentially 
damped sinusoids. It is known that QNMs do not form a complete set in a mathematical sense, but 
numerical results confirm that the late-time behavior is well described by such an expansion |60) . 
The angular dependence can be decomposed into a sum of spheroidal harmonics Si m of spin 
weight 2, which are labelled in analogy to standard spherical harmonics. For each (l,m), there are 
infinitely many resonant QNMs, which can be labelled in order of decreasing damping time by a 
third parameter n. The waveform expansion takes the form 

h+ +ih x = ^-Y. ^e i(u, ™ H *"» ) e-V^Simn, (44) 

Iran 

where u>i mn , Ti mn , Ai mn , and (f>i mn are the frequency, damping time, amplitude, and initial phase of 
the mode, and Si mn is a complex number obtained by evaluating the spheroidal harmonics for the 
particular orientation and mode frequency. The frequency and damping time, u>i mn and r;„ m , are 
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determined solely by the mass and frequency of the central black hole and are the key observables. 
The damping time can also be considered as the reciprocal of the imaginary part of a complex 
mode frequency, The amplitude and phase depend on the exact nature of the perturbation that 
excited the QNMs, so they are less useful. One complication is that the mode numbers that are 
excited [i.e., the (l,m,n) parameters] are not known a priori. In the first analysis of SNR from 
QNMs |146j . it was assumed that only the I = m = 2, n = was excited, which simplifies analysis 
of the observations. However, this simplifying assumption is not needed in general. 

The use of QNMs as a probe of black- hole structure was first proposed by Dreyer et al. |122) . 
who coined the phrase "black-hole spectroscopy" for this technique. A measurement of u>i mn 
and Ti mn for a single QNM will give several discrete solutions for the black-hole mass and spin, 
corresponding to different choices for the mode numbers. A measurement of a second QNM will 
give another discrete set of values, and in general these will not be the same as the first set except 
for one combination, which are the true values |122| . This not only determines the black- hole 
parameters, but provides a test that the central object is a Kerr black hole, since the relationship 
between black-hole parameters and frequencies depends on the fact that assumption. If the object 
was something else, we would find no consistent combination of mass and spin parameters to arise 
from the analysis. Dreyer et al. |122) considered a frequentist approach to analysing the QNM 
problem. The observable is a set of N complex QNM frequencies. We denote a set of choices for 
the N modes to which the frequencies correspond by Q. Each Q defines a two-dimensional curve 
in the 2N + 2 dimensional space, parameterized by (a, M). The probability distribution for the 
observed frequencies, u, may be denoted by P(u>\a,M, Q) and can be used to define a confidence 
interval by determining the pg such that 



Dreyer et al. |122] deemed that an observation was inconsistent with GR if the actual observed 
frequencies lay outside such confidence intervals for all possible choices of a, M and Q. Using a 
toy model for a non-black-hole source, and assuming that two QNMs were observed, drawn from 
four possible excited modes, they estimated that, with a false alarm probability of 1%, the false 
dismissal rate would be 60% if the SNR in the weakest mode was 10, falling to 10% at SNR of 
100. Assuming a reasonable amount of energy deposition into the QNMs, they estimated that this 
would be satisfied by all LISA massive-black-hole merger sources, so the prospects for such tests 
are good. 

Berti et al. [60] took a more detailed look at QNMs in general. They computed SNRs that 
improved on those in 146 by using an up-to-date detector noise curve and including black-hole 
spin. For a source of mass 10 5 M Q < M < 10 7 Af Q at a distance of 3 Gpc, they found SNRs between 
10 and 2 x 10 4 for an excitation energy of 3% of the rest mass, and between 1 and 3000 for an 
excitation energy of 0.1%. There is little dependence of the SNR on the central black-hole spin. 
Putting the source at higher z shifts the sensitive range of masses in the usual way, but even at 
redshift z = 10 the maximum SNR was found to be several hundred. Berti et al. [60] also looked 
at parameter-estimation accuracy from observations of one QNM (with known mode numbers) 
and found that LISA observations would measure the mass and spin to fractional accuracies in 
the range 10~ 2 -10~ 5 for sources at D — 3 Gpc and for the masses as quoted above. Applying 
these results naively to the measurement of two mode frequencies suggests that we should be more 
pessimistic, since the second mode will be suppressed by a factor of 10-100 compared to the first, 
with correspondingly enhanced error. 

Berti et al. |60| performed also a Fisher-matrix analysis of a two-mode problem, which accounted 
for all correlations between the modes. They considered both "pseudo-orthonormal" modes with 
different angular dependence ^ ( or m' ^ m) and "overtones" — that is, modes with the same 
angular dependence (/' = I and m' = m). In both cases, the accuracy of determination of the mass 
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(45) 
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and spin was comparable to the results quoted for a single mode, although the parameter space was 
simplified significantly to just five parameters: a, M, two mode amplitudes Ai, A2, and a common 
initial phase 4>. Berti et al. define two distinct QNMs to have resolvable frequencies if difference 
in these is larger than the error in both (and similarly for the damping times). By recasting the 
two-mode model in terms of the two mode frequencies and damping times rather than a and M, 
resolvability can be examined by looking at the Fisher-matrix errors. Berti et al. |60j considered 
two cases (the resolvability of either frequencies or damping times, and the resolvability of both), 
and they computed the source SNR needed in each case. When the two modes are overtones, the 
SNR required to resolve "cither" ranged from 75 to 125, depending on the black-hole spin and 
the (l,m) numbers of the mode, but the SNR required to resolve "both" ranged from 500 to 1500. 
When the two modes are "pseudo-orthonormal," the SNR range to resolve "either" is ~ 1-10 if 
the modes differ in I, and ~ 5 — 50 is the modes have the same I but differ in m and the central 
blackhole spin is a > 0.4. There is a mode degeneracy, which means the SNR required blows up 
when I = V and a — > 0. The SNRs required to resolve "both" modes are in the range 100-10000. 

Berti et al. (60j conclude that even under pessimistic assumptions about the amount of energy 
radiated in ringdown radiation, it should be possible to resolve one QNM and either the damping 
time or frequency of a second QNM, provided that the first overtone radiates ~ 10~ 2 of the total 
ringdown energy. This will provide enough for a test of the no-hair property of the central object. 
A stronger test would come from detecting frequencies and damping times for both QNMs, but 
this would require ringdown SNRs ~ 1000, which is rather unlikely. The main uncertainty in 
this analysis is in the excitation coefficients for the various modes, but these should be better 
determined by numerical relativity in the near future. 

In [212] . the authors used numerical-relativity simulations of nonspinning black-hole binaries, 
with a variety of mass-ratios ranging from 1:1 to 1:11, to compute the amplitude to which several 
different ringdown modes were excited, and hence an estimate of the SNR with which LISA would 
be able to observe them. For all the mass ratios considered, the (3,3) mode was found to radiate 
between 2% and 20% of the energy of the (2, 2) mode, and for mass ratios more unequal than 1:2 
the (4, 4) also radiated more than 1% of the energy of the (2, 2) mode. For sources at a reshfift of 
z = 1, they estimated a total ringdown SNR between 1000 and 20000 for black- hole masses in the 
range 1O 6 -1O 8 M0 and mass ratios from 1:1 to 1:25, with individual SNRs in the (2, 2), (2, 1), and 
(3,3) modes between several hundred and 10000. The mode SNRs do not add in quadrature, but 
these SNRs more than meet the requirements of j60| for LISA to be able to carry out black-hole 
spectroscopy. The SNRs realizable with eLISA will be factors of a few lower and the systems will 
tend to have lower masses, for which the intrinsic SNRs are also smaller. The constraints from 
eLISA will therefore be weaker and it is possible that eLISA will not be able to resolve sufficient 
separate modes, but this has not yet been explored. 

Space-based QNM observations could be used directly to put constraints on alternative theories 
of gravity, but this will require a calculation of the QNMs in those. It was shown in 403 that the 
equations governing black-hole perturbations in dynamical Chern-Simons gravity were different 
from those in GR, with the consequence that the QNM spectrum would also be different. The 
authors did not compute QNMs in that theory, however, and so it is not yet clear at what level 
ringdown radiation could constrain the Chern-Simons coupling parameter. 

If a LISA-like observatory observes a massive-black-hole inspiral and does not detect QNM 
ringing from the merger remnant where radiation would be expected, this might indicate a violation 
of the cosmic-censorship hypothesis. QNM ringing arises as a result of GWs becoming trapped near 
the horizon of the black hole. If a horizon was absent, for instance if a super-extremal (a/M > 1) 
Kerr metric was formed, we would not expect to observe the QNM radiation. This would be 
evidence for the existence of a naked singularity, and a counterexample to cosmic censorship, 
although not an indication of a problem with GR. 
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6.4 Prospects 

Although there are still many open questions, current research indicates clearly that low-frequency 
GW detectors will be able to make strong statements about the structure of the massive compact 
objects in the centers of galaxies by observing EMRIs. At the very least, it will be possible 
to test that an observation is consistent with an inspiral into a Kerr black hole, and to state 
quantitatively how large a deviation from Kerr could have been masked by instrumental noise. If 
a system differed significantly from Kerr, we would identify the deviation and should be able to 
quantify its nature. How well we would be able to distinguish between different types of deviation 
(e.g., external matter vs. a different multipole structure of the central object, is still not totally 
clear, but there are ongoing efforts in this direction. 

Low-frequency GW detectors will provide information on black-hole structure to a much higher 
precision than can be achieved by other techniques. It has been suggested that observations of 
intermediate mass ratio inspirals (IMRIs) with Advanced LIGO will have some power to do similar 
tests [74 . This data will be available five or more years prior to any space-based experiment. 
However, IMRI-based tests will be much weaker, since many fewer cycles of the GWs will be 
detected in a single observation due to the larger mass ratio (77 ~ 10 _2 -10 _1 ); furthermore, event 
rates are highly uncertain [249] . An IMRI observation would be able to detect a deviation from 
Kerr of fractional order unity in the quadrupole moment of a source [71], compared to the ~ 10 -3 
fractional accuracies achievable with LISA EMRIs . 

Observations in the electromagnetic spectrum can also probe black hole structure. Psaltis [300] 
provides a thorough review of tests of strong-field gravity using electromagnetic observations. The 
most relevant tests of black-hole structure are as folllows. 

• Horizon detection. X-ray interferometers (e.g., the Black Hole Imager) will soon have the 
angular resolution to directly image the horizon of extragalactic black holes at distances of 
~ 1 Mpc [300J . This is already barely possible with sub-mm/infrared observations. The 
accretion flow for the Milky Way black hole, Sgr A*, can already be imaged directly down to 
its innermost edge. Interpretation of the observations is complicated, however, by the need 
to simultaneously constrain the disc properties. Observations at multiple wavelengths will 
be required to fit out the disc and directly image the shadow of the black hole on the disc. 

Evidence for the existence for horizons also comes from observations of quiescent X-ray 
binaries. The systems that are thought to possess black holes are systematically less luminous 
than the systems thought to contain neutron stars. This is believed to result from advection 
of energy through the black-hole horizon [300J . 

Electromagnetic observations will really indicate the existence of a high-redshift surface in 
the system, and not necessarily an actual horizon. If a system contained a naked singularity 
with a high-redshift surface, but not a true event horizon, this would not be evident from 
the electromagnetic observations alone. By probing the multipole structure and verifying 
consistency with the no-hair theorem, LISA-like detectors will go much further. 

• ISCO determination. The highest-temperature emission from a disc comes from the in- 
nermost stable circular orbit, and the flux at that temperature is proportional to the ISCO 
radius squared. This allows the determination of the spin of the black hole. If the inferred 
spin was found to exceed one, this might indicate failure in the black-hole model. Such spin 
determinations have typical errors of ~ 10%, much greater than LISA's expected errors of 
~ 0.01% |42]. 

• Accretion disc mapping. Particles in the accretion disc around a black hole move on 
circular geodesies of the metric. If the orbits in the disc could be mapped, this would allow 
spacetime mapping along the lines of Ryan's theorem |311j . Two possible probes of accretion 
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disc structure have been identified: quasi-periodic oscillation (QPO) pairs and iron Ka lines. 
QPOs have been used to constrain possible values of black-hole masses and spins, but there 
are uncertainties in the radius at which they originate, and in the resonance that gives rise 
to the pairs of lines. Iron lines show broadening due to differential gravitational redshift 
and Doppler shift at different points in the disc, but again their interpretation depends on 
unknown details of the disc geometry. In principle, the time variability of an iron line after a 
single flare event could constrain both the geometry and map the disc. Future observations 
with instruments such as 1X0/ ATHENA will have the time-resolution to attempt this |300j . 

Electromagnetic observations are generally hampered by a lack of knowledge of the physics of the 
material that is generating the radiation. GW systems are, by contrast, very "clean" since the same 
theory describes the spacetime and the radiation generation. While there is some hope that future 
electromagnetic observations will perform crude spacetime mapping, LISA EMRI observations will 
improve any previous constraints by orders of magnitude. 
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7 Discussion 



Although the bibliography of this review lists ~ 400 references, fundamental tests of gravity remain 
the least explored sector of GW astronomy. It is also one where new discoveries may be hardest to 
come by — even if they would be momentous if they do. Space-based gravitational-wave detectors 
will improve certain existing constraints on alternative theories; even more important, they will 
provide novel and unique opportunities to precisely characterize the poorly explored nonlinear, 
dynamical sector of gravitation, as well as the properties of gravitational radiation fields. These 
measurements will provide a definite confirmation that Einstein's theory, born from insight and 
beauty, applies in the most extreme regimes; or they may expose new phenomena, leading to new 
models for gravitational physics. 

The research that we have reviewed spans a broad assortment of GW observations, which 
probe many disparate aspects of gravitational phenomenology. Some of these investigations have 
yet to be refined to the same level of formal rigor as other, more astrophysical applications of 
GW astronomy. Nevertheless, they paint an exciting picture of the expected fundamental-physics 
payoff of space-based detectors. We expect that this will remain an active and intriguing research 
area for many years: thus, it is appropriate that this review should be living, and we welcome the 
suggestions of our readers in improving it and keeping it up to date. 
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